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UNC-5 constructs and screening methods 

The present invention is concerned with unc-S, a 
conserved animal gene family that encodes proteins 
5 implicated in directional cell behaviour. In 

particular, the invention is concerned with novel 
splice variants of the human unc-5C cDNA and a novel 
human unc-5HSl cDNA sequence. In addition, assays are 
provided based on protein-protein interactions between 
10 the UNC-5 protein and a variety of different 
interacting proteins. 

Unc~5 is a conserved animal gene family that 
encodes proteins implicated in directional cell 
behaviour. The unc-5 gene of the nematode worm 
15 Caenorhabditis elegans (C. elegans) is known to be 
involved in dorsal migration in contrast to unc~40 
which is involved in ventral migrations (Hedgecock et 
al.. Neuron Vol. 2; 61-85, 1990). Both the unc-5 and 
unc-40 genes are associated with the netrin and 
20 all three genes play a dominant role in directional 
neuronal outgrowth . 

The C. elegans unc-5 gene encodes a 919 amino 
acid transmembrane receptor with two immunoglobulin 
and two thrombospondin type I extracellular domains 
25 (Leung-Hagesteijn et al., Cell Vol. 71:289-299, 1992). 

Ectopic overexpression of unc-5 in the C. elegans 
touch neurons resulted in dorsal steering of these, 
instead of the normal ventral elongation of these 
neurons (Hamelin et al.. Nature, 364:327-330, 1993). 
30 Several vertebrate homologues of unc-5 have been 

cloned including the Rattus norvegicus uncSHl and 
unc5H2 (Leonardo et al., Nature Vol. 386:833-838, 
1997), a Mus musculus homologue designated rem 
(Ackerman et al. Nature Vol. 386:838-842, 1997) and a 
35 human homologue unc5C (Ackerman et al . , Genomics Vol. 
52:205-208, 1998) . 
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The intracellular part of the UNC-5 proteins 
contains a ZO-1 domain. Such domains are known to be 
involved in tight junction biology. Furthermore UNC-5 
proteins contain a dearh domain. So far this is the 
5 only proLein found in C. elegans that harbors such a 
death domain. Death domains are involved in the 
apoptotic process. In this process, caspases play an 
important role. The human UNC-40 homologue DCC, a 
protein also known involved in axonal outgrowth, is a 
10 caspase-3 substrate (Mehen et al.. Nature 395:801-804, 
1998) . 

The present inventors have identified three 
previously unknown variant unc-5C cDNAs. These 
variant cDNAs correspond to alternatively spliced unc- 
15 5C transcripts. 

Accordingly, in a first aspect provides a protein 
which comprises the amino acid sequence set forth in 
SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 or an amino 
acid sequence which differs from that shown in SEQ ID 
20 NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 only in 
conservative amino acid changes. 

Also provided by the invention are nucleic acid 
sequences which encode the proteins of the invention. 

Also provided by the invention are a nucleic acid 
25 comprising the sequences of nucleotides set forth in 
SEQ ID NO: 1, a nucleic acid comprising the sequences 
of nucleotides set forth in SEQ ID NO: 3 and a nucleic 
acid comprising the sequences of nucleotides set forth 
in SEQ ID NO: 5. 
30 The splice variants of human unc-5C were cloned 

by PGR technology. Two primers were developed to 
amplify the intracellular part of the unc-5C. Human 
Brain cDNA was used for this purpose. Three new splice 
variants of human unc-5C were characterized. A 
35 schematic representation of these splice variants is 
given in Figure 5. 

The first splice variant (designated unc-5Cb) has 
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a deletion of an intron in the UP region. The 
nucleotide sequence of a partial unc-5Cb cDNA is set 
forch in SEQ ID NO: 1 and the corresponding amino acid 
sequence is set forth in SEQ ID NO: 2. The splice of 
5 this intron results in a UNC-5Cb protein which is 

considerably shorter than the previously known UNC-5C, 
as the coding frame is not maintained. This protein is 
truncated for the DD domain and for the major part of 
the UP domain. 

10 The second splice variant (designated unc-5Cc) is 

deleted by an intron in the ZO-1 region, also 
resulting m a shorter protein than the previously 
knovm UNC-5C, as the coding frame is not maintained. 
The nucleotide sequence of a partial UNC-5Cc cDNA is 

15 set forth in SEQ ID NO: 3 and the corresponding amino 
acid sequence is shown in SEQ ID NO: 4. The resulting 
protein (UNC-5Cc) is truncated for the DD domain, the 
UP domain and a part of the ZO-1 domain. 

The third splice variant (unc-5C8) is deleted by 

20 a small intron in the ZO-1 domain, but the coding 
frame is maintained. This results in a slightly 
smaller protein (UNC-5C8), wherein only the amino acid 
sequence coded by the spliced intron is truncated. 
The nucleotide sequence of a partial UNC-5C8 cDNA is 

25 set forth in SEQ ID NO: 5 and the corresponding amino 
acid sequence is shown in SEQ ID NO : 6 . 

The presence of various splice variants of unc-5C 
in the human brain indicated that the activity of 
UNC-5C is tightly regulated. 

30 The inventors have also identified a human unc-5 

cDNA which shares homology with the Rattus norvegicus 
unc-5Hl cDNA. 

Accordingly, in a further aspect the invention 
provides a nucleic acid molecule comprising the 

35 sequence of nucleotides set forth in SEQ ID NO: 7. 

Whilst performing yeast two hybrid experiments to 
identify proteins which interact with the human UNC-5C 
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protein the inventors identified a number of 
heretofore unknown human cDNAs which encode proteins 
v;hich interact with human UNC-5C. 

Accordingly, the invention further provides 
a nucleic acid comprising the sequence of nucleotides 
set forth in SEQ ID NO: 56 and a sequence of 
nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 57, a nucleic acid 
comprising the sequence of nucleotides set forth in 
SI:q id NO: 54 and a sequence of nucleotides 
complementary to the sequence of nucleotides set forth 
^r. SEQ ID NO: 55, a nucleic acid comprising the 
sequence of nucleotides set forth in SEQ ID NO: 61 and 
a sequence of nucleotides complementary to the 
sequence of nucleotides set forth in SEQ ID NO: 62 
and a nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 63 and a sequence 
c: nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 64. 

The nucleic acid molecules according to the 
invention may, advantageously, be included in a 
suitable expression vector to express the proteins 
encoded therefrom in a suitable host. Incorporation 
of cloned DNA into a suitable expression vector for 
subsequent transformation of said cell and subsequent 
selection of the transformed cells is well known to 
tnose skilled in the art as provided in Sambrook et 
al. (1989), molecular cloning, a laboratory manual , 
Cold Spring Harbour Laboratory Press. 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
the invention cperably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
35 "operably linked'' refers to a juxtaposition wherein 
the components described are in a relationship 
permitting them to function in their intended manner. 
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Such vectors may be transformed into a suitable host 
cell to provide for expression of a protein according 
to the invention. Thus, in a further aspect, the 
invention provides a process for preparing proteins 
5 according to the invention which comprises cultivating 
a host cell, transformed or transfected with an 
expression vector as described above under conditions 
to provide for expression by the vector of a coding 
sequence encoding the protein, and recovering the 
10 expressed protein . 

The vectors may be, for example, plasmid, virus 
or phage vectors provided with an origin of 
replication, and optionally a promoter for the 
expression of said nucleotide and optionally a 
15 regulator of the promoter. The vectors may contain 

one or more selectable markers, such as, for example, 
an antibiotic resistance. 

Regulatory elements required for expression 
include promoter sequences to bind RNA polymerase and 
20 to direct an appropriate level of transcription 

initiation and also translation initiation sequences 
for ribosome binding. For example, a bacterial 
expression vector may include a promoter such as the 
lac promoter and for translation initiation the Shine- 
25 Dalgarno sequence and the start codon AUG. Similarly, 
a eukaryotic expression vector may include a 
heterologous or homologous promoter for RNA polymerase 
II, a downstream polyadenylat ion signal, the start 
codon AUG, and a termination codon for detachment of 
30 the ribosome. Such vectors may be obtained 

commercially or be assembled from the sequences 
described by methods well known in the art. 

Nucleic acid molecules according to the invention 
may be inserted into the vectors described in an 
35 antisense orientation in order to provide for the 

production of antisense RNA. Antisense RNA or other 
antisense nucleic acids, including antisense peptide 
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nucleic acid (PNA), may be produced by synthetic 
means . 

In accordance with the present invention, a 
defined nucleic acid includes not only rhe identical 
5 nucleic acid but also any minor base variations 

including in particular, substitutions in cases which 
result in a synonymous codon (a different codon 
specifying the same amino acid residue) due to the 
degenerate code in conservative amino acid 
10 substitutions. The term '^nucleic acid sequence'' also 
includes the complementary sequence to any single 
stranded sequence given regarding base variations. 

The nucleic acid sequences according to the 
invention may be produced using recombinant or 
15 synthetic techniques, such as for example using PGR 
which generally involves making a pair of primers, 
which may be from, approximately 10 to 50 nucleotides 
to a region of the gene which is desired to be cloned, 
bringing the primers into contact with cDNA, or 
genomic DNA from a human cell, performing a polymerase 
chain reaction under conditions which brings about 
amplification of the desired region, isolating the 
am.plified region or fragment and recovering the 
amplified DNA, Generally, such techniques are well 
25 known in the art, such as described in Sambrook et al. 
(Molecular Cloning: a Laboratory Manual, 1989) . 

The nucleic acids according to the invention may 
carry a revealing label. Suitable labels include 
radioisotopes such as ^^P or ^-S, enzyme labels or 
other protein labels such as biotin or fluorescent 
markers. Such labels may be added to the nucleic 
acids or oligonucleotides of the invention and may be 
detected using known techniques per se. 

The protein according to the invention includes 
35 all possible amino acid variants encoded by the 
nucleic acid molecule according to the invention 
including a protein encoded by said molecule and 
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having conservative amino acid changes. Proteins or 
polypeptides according to the invention further 
include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said proteins or 
polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 7 0%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 
me nucleic acid molecules according to the invention . 
"r.e orotein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 
r: ref erably recombinant . 

A further aspect of the invention provides a host 
cell or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
method of producing protein , which comprises 
recovering any expressed protein from the host or 
organism transformed or transfected with the 
expression vector , 

According to a further aspect of the invention 
there is also provided a transgenic cell, tissue or 
organism comprising a transgene capable of expressing 
2 5 a protein according to the invention . The term 
"transgene capable of expressing'' as used herein 
encompasses any suitable nucleic acid sequence which 
leads to expression of proteins having the same 
funcrion and/or activity. The transgene, may include, 
30 for example, genomic nucleic acid isolated from human 
ceils or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
state. Preferably, the transgene comprises the 
nucleic acid sequence encoding the proteins according 
35 to the invention as described herein, or a functional 
fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 



fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecusor of said proteins. 

The protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecusor of said protein also forms part of the 
present invention. Recombinant proteins may be 
recovered and purified from host cell cultures by 
methods known in the art, including ammonium sulfate 
or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose , 
chromatography, hydrophobic interaction 
chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
chromatography . 

The protein of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant ^ 
techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
expressed protein may lack the initiating methionine 
residue as a result of post-translational cleavage. 
Proteins which have been modified in this way are also 
included within the scope of the invention. 

In a still further aspect the invention provides 
an antibody capable of specifically binding to a 
protein according to the invention. Preferably the 
antibody is capable of specifically binding to a 
protein comprising the sequence of amino acids set 
forth in SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6. 
An antibody according to the invention may be raised 
according to standard techniques well known to those 
skilled in the art by using the protein of the 
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invention or a fragment or single epitope thereof as 
the challenging antigen. 

A further aspect of the invention comprises a 
nucleic acid capable of hybridising to the nucleic 
5 acids according to the invention, and preferably 

capable of hybridising to the sequence of nucleotides 
set forth in SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 
56, SEQ ID NO: 57, SEQ ID NO: 61, SEQ ID NO: 62, SEQ 
ID NO: 63 or SEQ ID NO: 64, under high stringency 

10 conditions. Conditions of stringency are well known 
to those skilled in the art. 

Stringency of hybridisation as used herein refers 
to conditions under which polynucleic acids are 
stable. The stability of hybrids is reflected in the 

15 melting temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81 . 5°C+16. 6 (logio[Na"] +0.41 (%G&C) -600/1 

20 wherein 1 is the length of the hybrids in nucleotides. 
Tm decreases approxim.ately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The nucleic acid capable of hybridising to 
nucleic acid molecules according to the invention will 

2 5 generally be at least 70%, preferably at least 80 or 

90% and more preferably at least 95% homologous to the 
nucleotide sequences according to the invention. 

The present invention also advantageously 
provides oligonucleotides consisting essentially of at 

30 least 10 consecutive nucleotides of a nucleic acid 

according to the invention and preferably from 10 to 
50 consecutive nucleotides of a nucleic acid according 
to the invention, in particular a nucleic acid 
comprising the sequence of nucleotides shown in SEQ ID 

35 NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, 

SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63 or SEQ ID 
NO: 64. These oligonucleotides may, advantageously be 
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used as probes or primers to initiate replication, or 
the like. Oligonucleotides having a defined sequence 
may be produced according to techniques well known in 
the art, such as by recombinant or synthetic means. 
5 They may also be used in diagnostic kits or the like 

for detecting the presence of a nucleic acid according 
to the invention. These tests generally comprise 
contacting the probe with the sample under hybridising 
conditions and detecting for the presence of any 

10 duplex or triplex formation between the probe and any 
nucleic acid in the sample. 

To address the functional role of UNC-5 within 
the cell the inventors used the yeast two hybrid 
method (Fields and Song, Nature 340:245, 1989), a 

15 method well known to molecular biologists, both to 

investigate the ability of UNC-5 to form dimers and to 
search for proteins that interact with the UNC-5 
protein. Using the two hybrid approach the inventors 
were able to demonstrate that UNC-5 is capable of 

20 forming homodimers and identified a number of proteins 
which interact with the intracellular domains of the 
C. elegans unc-5 or human UNC-5 proteins. These newly 
identified protein-protein interactions involving UNC- 
5 may represent important events in cellular 

25 signalling, hence compounds which disrupt these 
interactions may potentially have useful 
pharmacological properties . 

Accordingly, in a further aspect the invention 
provides a method of identifying compounds which are 

30 capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
method comprises: 

providing a host cell containing a DNA 

35 construct comprising a reporter gene operatively 

linked to a promoter regulated by a transcription 
factor having a DNA binding domain and an 
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activating domain; 

expressing in said host cell a first hybrid 
DNA sequence encoding a first fusion protein 
comprising an UNC-5 protein or a fragment thereof 
5 fused in- frame to either the DNA binding domain 

or the activating domain of the said 
transcription factor ; 

expressing in said host cell a second hybrid 
DNA sequence encoding a second fusion protein 

10 comprising an interacting protein or a fragment 

thereof fused in-frame to either the DNA binding 
domain or the activating domain of the said 
transcription factor, such that when the first 
fusion protein comprises the activation domain of 

15 the said transcription factor the second fusion 

protein comprises the DNA binding domain of the 
said transcription factor and when the first 
fusion protein comprises the DNA binding domain 
of the transcription factor the second fusion 

20 protein comprises the activation domains- 

contacting the host cell with a sample of 
the compound under test; and 

detecting any binding of the UNC-5 protein 
or fragment thereof to the interacting protein or 

25 fragment thereof by detecting the production of 

any reporter gene product in the said host cell. 

The method of the invention is based upon the 
standard two hybrid assay well known in the art. 
30 Preferably the host cell is a yeast cell. Protocols 

for performing a yeast two hybrid assay are well known 
in the art and are given in the Examples included 
herein . 

As would be readily apparent to persons skilled 
35 in the art, the assay can be performed in either 
orientation. That is to say, the assay can be 
performed using an UNC-5 protein or a fragment thereof 
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fused to the DNA binding domain of the transcription 
factor and the interacting protein or fragment thereof 
fused to the activation domain of the transcription 
factor or alternatively the assay can be performed 
5 using an UNC-5 protein or a fragment thereof fused to 
the activation domain of the transcription factor and 
the interacting protein or fragment thereof fused to 
the DNA binding domain of the transcription factor. 

The above-described method based on the classical 

10 yeast two hybrid system can be used to screen for 
compounds that inhibit or enhance the interaction 
between two proteins. In addition, other systems have 
been developed to screen for dissociation events, 
these methods are designated reverse hybrid methods. 

15 These syste^is make use of yeast strains in which the 
expression cf interacting hybrid proteins increases 
the expression of a counter-selectable marker that is 
toxic under particular conditions. Under these 
conditions, dissociation of an interaction provides a 

20 selective advantage, thereby facilitating detection: A 
few growing yeast colonies in which hybrids fail to 
interact can be identified among millions of 
non-growing colonies expressing interacting proteins- 
Several reverse hybrid systems are known in the art. 

25 The first reverse two-hybrid system utilizes a 

yeast strain, which is resistant to cycloheximide due 
to the presence of a mutant CYH2 gene. This strain 
also contains the wild-type CyH2 allele under the 
transcriptional control of the GALl promoter. 

30 Expression of the wild-type GAL4 protein is sufficient 
to restore growth sensitivity to cycloheximide. Growth 
sensitivity towards cycloheximide is also restored by 
the co-expression of the avian c-Rel protein and its 
iKB-a counterpart, p40, as GAL4 fusion proteins. 

35 Restoration of growth sensitivity towards 

cycloheximide requires the association of c-REL and 
p40 at the GALl promoter and correlates with the 
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ability of the c-REL/p40 interaction to activate 
expression from the GALl promoter (Leanna and Hannink, 
1996, NAR 24:3341-3347) 

Another reverse hybrid system makes use of the 
5 most widely used counter-selectable marker in yeast 
genetics, URA3, which encodes orotidine-5' -phosphate 
decarboxylase, an enzyme required for the biosynthesis 
of uracil. Yeast cells that contain wild-type URA3 , 
either on a plasmid or integrated in the genome, grow 

10 on media lacking uracil (URA3+ phenotype) , However, 
the URA3-encoded decarboxylase can also catalyze the 
conversion of a non-toxic analogue, 5-f luoroorit ic 
acid (FOA) into a toxic product, 5-fluoroacil (Boeke 
et al., 1984, Mol. Gen. Genet. 197:345-346). Hence 

15 mutations that prevent an interaction can be selected 
from large libraries of randomly mutated alleles. 
Similarly, molecules that dissociate or prevent an 
interaction could be selected from large libraries of 
peptides or compounds (Vidal et al,, 1996, PNAS 

20 93:10315-10320; Vidal et al., 1996, PNAS 

93: 10321-10326) . 

A third reversed yeast two hybrid is based on the 
use of GAL80 gene as relay gene, GAL80 encodes a 
protein that binds to and masks the activation domain 

25 of a transcriptional activator^ such as GAL4 . The 

reporter genes, which will provide the transcriptional 
read-out (i.e. HIS3 or LACZ), are dependent upon the 
functional GAL4 for expression. Only when the level of 
GAL80 masking protein is reduced by interfering with 

30 the two-hybrid interaction will Gal4 function as a 
transcriptional activator, providing a positive 
transcriptional read-out for molecules that inhibit 
the two-hybrid protein-protein interaction. An 
important feature of this reverse two-hybrid system is 

35 that the basal level and the half-time of the relay 
protein, GAL80, can be fine-tuned to provide maximum 
sensitivity (Powers and Erickson, 1996, WO95/26400). 
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The invention further provides a method of 
identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
5 to the said UNC-5 protein, which method comprises: 

providing a transgenic cell or organism 
expressing a first fusion protein comprising an 
UNC-5 protein or a fragment thereof fused in- 
frame to a first genetically encoded fluorophore 
10 and a second fusion protein comprising an 

interacting protein or a fragment thereof fused 
in-frame to a second genetically encoded 
fluorophore, the first and second fluorophores 
being characterised in that the emission spectrum 
15 of one of the fluorophores overlaps with the 

absorption spectrum of the other fluorophore; 

measuring the amount of fluorescence emitted 
from the fluorophore having an emission spectrum 
which overlaps with the absorption spectrum of 
20 the other fluorophore; 

exposing the transgenic cell or organism, to 
a compound under test; and 

detecting any change in the amount of 
fluorescence emitted fluorescence emitted from 
25 the fluorophore having an emission spectrum which 

overlaps with the absorption spectrum of the 
other fluorophore . 

This method uses fluorescence energy transfer or 
30 FRET, a technique well known in the art for the 

detection and quantitative measurement of a whole 
range of specific binding interactions in biological 
systems, to screen for compounds which modulate the 
binding of UNC-5 or a fragment thereof to an 
35 interacting protein. The general principles of FRET 
are as follows: one component of a binding pair is 
labelled with a first fluorophore (hereinafter 
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referred to as the donor fluorophore) and a second 
component of the binding pair is labelled with a 
second fluorophore (hereinafter referred to as the 
acceptor fluorophore) . 
5 It is an essential feature of the FRET technique 

that the fluorescence emission spectrum of the donor 
fluorophore overlaps with the absorption spectrum of 
the acceptor fluorophore, such that when the two 
components of the binding pair bind to each other, 

10 bringing the donor and acceptor fluorophores into 
close proximity, a proportion of the fluorescent 
signal emitted by the donor fluorophore (following 
irradration with incident radiation of a wavelength 
absorbed by the donor fluorophore) will be absorbed by 

15 the proximal acceptor fluorophore ( a process known in 
the art as fluorescence energy transfer) with the 
result that a proportion of the fluorescent signal 
emitted by the donor fluorophore is quenched and, in 
some instances, that the acceptor fluorophore emits 

20 fluorescence. Fluorescence energy transfer will only 
occur when the donor and acceptor fluorophores are 
brought into close proximity by the specific binding 
reaction. Thus, in the presence of a compound which 
disrupts the specific binding, the amount of quenching 

25 is reduced resulting in an increase in the intensity 
of the fluorescent signal emitted by the donor 
fluorophore or a fall in the intensity of the signal 
emitted by the acceptor fluorophore) . 

The method of the invention is an in vivo FRET 

30 assay because it is performed in a transgenic host 
cell or organism. The transgenic cell can be any 
mammalian cell line, the transgenic organism is 
preferably C. elegans . 

The method of the invention uses genetically 

35 encoded donor and acceptor fluorophores which can be 
expressed as fusion proteins fused in frame to the 
UNC-5 protein and the interacting protein. This can 
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be readily accomplished by transforming or 
transfecting the cell or organism with appropriate 
expression vectors arranged to express the fusion 
proteins . 

In a preferred embodiment the genetically encoded 
donor and acceptor proteins are variant green 
fluorescent proteins which exhibit different 
fluorescent properties and which have suitably 
overlapping emission/absorption spectra, such as EGFP 
{enhanced green fluorescent protein) and EBFP 
■enhanced blue fluorescent protein). As would be 
readily apparent to persons skilled in the art, the 
FP.ET assay can be performed in either orientation. 
That is to say, the assay can be carried out using 
UNC-5 fused to the donor fluorophore and the 
interacting protein fused to the acceptor fluorophore 
or using UNC-5 fused to the acceptor fluorophore and 
the interacting protein fused to the donor 
fluorophore . 

The invention further provides a method of 
identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
to the said UNC-5 protein, which method comprises: 
providing a first reaction component 
comprising a first protein linked to a solid 
support containing a scintillanr and a second 
reaction component comprising a second protein 
which has been radioact i vely labelled, wherein 
the first and second proteins are an UNC-5 
protein or a fragment thereof and an interacting 
protein or a fragment thereof; 

bringing the first and second reaction 
components into contact in an aqueous solution in 
the presence of a compound under test; and 

detecting binding of the first protein to 
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the second protein by detecting light emission 
from the scintillant. 

The above method is based on the scintillation 
5 proximity assay (SPA™) developed by Amersham and 

commonly used in automated high throughput screening. 
In order to perform^ this assay a first interacting 
protein (e.g. an UNC-5 protein) must be linked onto a 
bead containing a scintillant. Linking of the protein 
10 to the beads can be carried out in many different 

ways, including, for example , via biot in-streptavidin 
affinity binding . St reptavidin-SPA beads are 
commercially available from Amersham and the 
interacting protein can easily be biotinylated in 
15 vitro or expressed as a biotinylated fusion protein 

using techniques known in the art . The second 
interacting protein (e.g. a protein known to interact 
with UNC-5) is labelled with radioactivity. This can 
be achieved, for example, by synthesising the second 
20 interacting protein by in vitro translation and 
incorporating a tritiated precursor amino acid. 
The SPA""" assay protocol is then as follows : 

SPA beads linked to the first interacting protein 
are incubated for 30 minutes to one hour with a sample 
25 containing the radioactively labelled second 
interacting protein. Upon binding of the two 
interacting proteins, the radioactivity emitted by the 
labelled protein is brought into close proximity with 
the bead containing scintillant and therefore induces 
30 light emission from the scintillant. The free labelled 
protein in sample (non-bound) will not be held in 
sufficiently close proximity to the beads to induce 
light emission. Compounds which disrupt the binding 
of the first and second interacting proteins will 
35 cause a decrease in the amount of light emitted during 
the experiment. 

As would be readily apparent to persons skilled 
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in the art the assay can be carried out using UNC-5 
linked to the solid support containing scintillant and 
a radioact ively labelled interacting protein or using 
an interacting protein linked to the solid support 
5 containing scintillant and a radioact ively labelled 
UNC-5, 



The invention further provides a method of 
identifying compounds which are capable of inhibiting 
10 or enhancing the binding of an UNC-5 protein to an 

interacting protein previously identified as binding 
to the said UNC-5 protein, which method comprises: 

coating the wells of a microtiter plate with 
UNC-5 protein or a fragment thereof; 
15 contacting the UNC-5 protein or fragment 

thereof with an aqueous solution comprising an 
interacting protein or a fragment thereof, said 
interacting protein being labelled with a tag 
which is directly or indirectly detectable, and a 
20 compound under test; 

washing to remove the compound under test 
and any unbound tagged interacting protein; and 
detecting complexes of UNC-5 or a fragment 
thereof bound to the interacting protein or a 
25 fragment thereof by directly or indirectly 

detecting the presence of the tag. 



This method of the invention uses an ELISA type 
approach to screen for compounds which disrupt binding 

3C between Unc-5 and a protein known to interact with 

UNC-5. In these experiments, the wells of a microtiter 
plate are coated with the UNC-5 protein or fragments 
thereof, A sample containing both the compound under 
test and a protein known to interact with UNC-5 (or a 

35 fragment of the protein which is still capable of 

binding to UNC-5) is then added to the wells and the 
plates are incubated to allow time for specific 
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binding of UNC-5 to the interacting protein. The 
interacting protein (or fragment thereof) is labelled 
with a tag which is directly or indirectly detectable, 
typically a fluorescent molecule such as GFP, or a tag 
which is detectable by specific antibody binding, such 
as a His-tag or GST-tag. Many other tag molecules 
which are equally suitable for this purpose are known 
in the art and are available commercially. The wells 
are then washed to remove the compound and any 
interacting proteins which remain unbound. Any 
interacting protein which has become bound to UNC-5 is 
not removed by the washing step and can be detected 
via the directly or indirectly detectable tag. If the 
interacting protein is labelled with a GFP tag, then 
bound proteins are detected by measuring GFP 
fluorescence; if the interacting protein is labelled 
with a His-tag or a GST tag, bound proteins are 
detected with immunological techniques, using an 
antibody of the appropriate specificity. 

Compounds which disrupt the binding of UNC-5 to 
the interacting protein will result in more of the 
protein remaining unbound, hence less protein will be 
detected after the washing step. 

The invention further provides a method of 
identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
to the said UNC-5 protein, which method comprises: 

exposing a cell or organism expressing UNC-5 

and overexpressing nucleic acid encoding an 

interacting protein to the compound under test; 

and 

screening for reversion of the 
overexpression phenotype of the cell or organism 
to wild-type. 
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10 



Over-expression of genes encoding for proteins 
which interact with UNC-5 in a cell line or in C. 
elegans results in an over-expression phenotype. 

Assays to select for compounds that inhibit the 
interaction of UNC-5 and its interacting proteins can 
therefore be performed in cell lines or C. elegans by 
exposing cells or worms exhibiting an over-expression 
phenotype to the compound under test and screening for 
a 'reduction' of the over-expression phenotype (i.e. 
screening for a reversion to wild-type) . 

Over-expression of proteins which interact with 
unc-5 in C, elegans typically results in neuronal 
outgrowth phenotypes, distal tip cell outgrowth 
phenotypes, and other aberrant outgrowth of various 
15 tissues and cells. These phenotypes can be easily 
monitored by expressing reporter genes, such as 
fluorescent proteins in these cells. Reduction of the 
phenotype induced by the over-expression can then be 
monitored by visual inspection. 
20 Simple assays have been developed to screen for 

compounds which cause reversion of the over-expression 
phenotype in cell lines. As Unc-5 receives signals 
from the netrins, over-expression of proteins which 
interact with unc-5 typically causes phenotypic 
25 changes in neuronal outgrowth and cell movement. 

Accordingly, the step of screening for reduction of 
the over-expression phenotype can be performed using a 
laminin assay, a netrin response assay and assays 
using agarose concentration gradients, a boyden 
30 chamber or stratified layers (see Gundersen, R. W., 

Dev. Biol., 1987, 121(2): 423-431; Klostermann, S. and 
Bonhoeffer, F. , 1996, 4: 237-252). In general, these 
methods are based upon providing attractants or 
repellants for axonal guidance in a controlled manner. 
35 The way the cells react to these attractants and 
repellants forms the basis of the assay. In the 
Boyden chamber (upper and lower chambers separated by 
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a filter barrier) one typically cultivates cells in 
the upper chamber and measures how the cells grow 
through the filter. The agarose approach allows the 
establishment of gradients to which the cells react by 
5 forming specific patterns. 

The above-listed methods are all based upon novel 
interactions between an UNC-5 protein and proteins 
shown to physically interact with the UNC-5 protein. 
10 In preferred embodiments, the UNC-5 protein is a C. 
eiegans UNC-5 protein or a human UNC-5 protein. 
Preferably the human UNC-5 protein is UNC-5C or any of 
the UNC-5C splice variants identified hereinbefore or 
UNC-5HS1 . 

15 The methods of the invention can also be carried 

out using fragments of the UNC-5 protein which retain 
the ability to bind to the interacting protein. 
Preferably the fragment comprises the intracellular 
portion of the protein . Various sub- domains of the 
20 intracellular portion of the protein or combinations 
thereof can also be used. 

As used herein the term ''interacting protein'' 
encompasses any protein which has been demonstrated to 
interact with an UNC-5 protein. The interacting 
25 protein can be a second UNC-5 protein as the examples 
included herein demonstrate the ability of UNC-5 to 
form homodimers. The interacting protein can also be 
a protein identified as interacting with UNC-5 in a 
yeast two hybrid experiment. A list of proteins 
30 identified as interacting with C. eiegans UNC-5 or 

human UNC-5 in a yeast two hybrid experiment is given 
m the Example 4, below. Any of these proteins, or 
fragments thereof which retain a functional UNC-5 
binding site, can be used in the methods of the 
35 invention in combination with the appropriate UNC-5 
protein or a fragment thereof . 

As would be readily apparent to persons skilled 
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in the art, the UNC-5 signalling pathway is highly 
conserved across species. Hence it is to be expected 
that for every interacting protein identified in the 
yeast two hybrid experiments described in the Examples 
given herein a homologous interacting protein will be 
found in other species. For example, for every 
interacting protein found in C. elegans to interact 
with the C. elegans unc-5 protein it is expected that 
a homologous interacting protein will be found in 
humans and will interact with a human UNC-5 protein, 
and vice versa for interacting proteins first 
identified in humans. Accordingly, it is within the 
scope of the invention to perform the methods 
described above with '^homologous combinations" of UNC- 
5 proteins and interacting proteins and even with 
cross-species combinations e,g, elegans unc-5 and a 

human interacting proteins, human UNC-5 and a human 
homologue of an interacting protein identified in C. 
elegans / C. elegans unc-5 and a human homologue of an 
interacting protein identified in C. elegans ; C, 
elegans unc-5 and a human interacting protein etc. 
Lists of homologues of the C, elegans and human 
interacting proteins identified in the yeast two 
hybrid study are given in the Examples included 
herein , 

In a still further aspect the invention provides 
a method of identifying compounds which reduce or 
inhibit the lethal phenotype associated with the 
expression of the UNC-5 death domain in yeast, which 
method comprises: 

exposing a yeast cell containing an 
expression vector comprising nucleic acid 
encoding an UNC-5 protein or a fragment thereof 
comprising the death domain to a compound under 
test ; 
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allowing the yeast cells to grow in the 
presence of the compound; and 

screening for a reduction or inhibition of 
the lethal phenotype associated with the 
5 expression of the UNC-5 death domain in yeast . 



The UNC-5 protein used in the method of the 
invention is preferably a C. elegans UNC-5 protein or 
a human UNC-5 protein . Preferably the human UNC-5 
10 protein is UNC-5C or any of the UNC-5C splice variants 
identified hereinbefore or UNC-5HS1 . 



In a still further aspect the invention provides 
a method of identifying suppressors of the lethal 
15 phenotype associated with the expression of the UNC-5 
death domain in yeast, which method comprises: 

transfecting yeast cells containing an 
expression vector comprising nucleic acid 
encoding an UNC-5 protein or a fragment thereof 
20 comprising the death domain with a cDNA library 

cloned in a yeast expression vector; 

allowing the transfected yeast cells to grow 
for one or more cell divisions; and 

screening for reduction or inhibition of the 
25 lethal phenotype associated with the expression 

of the UNC-5 death domain in yeast. 

Optionally, the method further comprises the 
steps of: 

30 identifying a transfected yeast cell 

exhibiting a reduction or inhibition of the 
lethal phenotype associated with the expression 
of the UNC-5 death domain in yeast; and 

isolating the cDNA clone (s) present in the 

35 transfected yeast cell which is/are responsible 

for conferring reduction or inhibition of the 
lethal phenotype . 
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Again, the UNC-5 protein is preferably a C. 
elegans UNC-5 protein or a human UNC-5 protein. 
Preferably the human UNC-5 protein is UNC-5C or any of 
the UNC-5C splice variants identified hereinbefore or 
5 UNC-5HS1. The cDNA library is preferably a C. elegans 

cDNA library or a human cDNA library. 



The invention will be further understood with 
0 reference to the following experimental examples, 
together with the accompanying Figures in which: 



Figure 1 shows a sequence alignment of the known human 
5 unc-5C cDNA sequence and the three novel alternative 
splice variants of unc-5C. The region of alignment 
corresponds to the portion of the cDNA which encodes 
the intracellular domains of unc-5C. 



Figure 2 shows a multiple alignment of unc-5Hl genes. 
ym97dl2 is an EST clone containing a fragment of the 
unc-5HSl cDNA, 3D is a fragment of the unc-5HSl cDNA 
cloned by PCR in Example 2. 

Figure 3 summarises the cloning of human unc-5C 
variants . 



Figure 4 summarises the cloning of human unc-5HSl. 

Figure 5 is a schematic representation of the human 
unc-5C splice variants. 

Figure 6 shows an alignment between a fragment of the 
protein encoded by the cDNA fragment cloned in pYMP6 
and the rat neurexim Il-alpha-b cDNA. 



Figure 7 shows an alignment between a fragment of the 
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protein encoded by the cDNA fragment cloned in pYMPl? 
and the mouse mena protein. 

Figure 8 is a representation of the vector pGC1037. 
Figure 9 is a representation of the vector pGC1003- 



Examole 1 

1 0 Cloning of the human unc-5C splice variants. 

Splice variants of human unc-5C were cloned, 
primary with RACE technology. 

A 5' RACE was performed using the 5' RACE System 
for Rapid Amplification of cDNA Ends, Version 2.0 
15 {GibcoBRL, Merelbeke, Belgium) , according the 

instructions supplied by the manufacturer or with 
minor modifications thereof. The primers were based on 
the unc-5 EST ym97dl2. 

The first strand cDNA synthesis was performed with 
20 primer: 

GS Pl=oGC7 5 : CGTAGCAGGCACTGGCCTCC 

PGR of dC-tailed cDNA: was performed with the gene- 
specific primer: 

GSP2=oGC76: GCACTGGCCTCCAGCTGGCAGTAG 
25 and the RACE anchor primer supplied with the 5 RACE 
system. 

The PGR Program was: 

Step 1 94"C, 2 min 
Step 2 94°C, 30 sec 
30 Step 3 60°C, 30 sec 

Step 4 72°C, 2 min 
Repeat steps 2 to 4 for 35 cycles 
Step 5 72°C, 7 min 
Step 6 4°C 



35 



A nested PGR was performed with gene-specific primer: 
GSP3=oGC77 : AGTAGAGGTGGGAGGGCGCCTCCTCGCCCAG 
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and 5' RACE anchor primer 



The PGR program was: 

Srep 1 92°C, 2 min 
5 Step 2 92°C, 1 min 

Step 3 68°C, 2 min 
Repeat steps 2 and 3 for 35 cycles 
Step 4 ll^'C, 7 min 
Step 5 4°C 

10 The resulting ElACE products were visualised by 

electrophoresis on agarose gels, the bands excised 
and purified with Jetsorb (Genomed, Germany) . The RACE 
products were ligated into plasmids pAS2 and pGEX-5X-3 
with T4 DNA ligase (Amersham pharmacia biotech, NJ, 

15 USA), or into a TA cloning vector (Invitrogen, 

Groningen, the Netherlands) . Plasmid DNA was purified 
prior to sequencing using the Qiagen plasmid 
purification system (Westburg, Leusden, The 
Netherlands ) . 

20 

Example 2 

Cloning of a new human unc-5 gene. 

Human Brain Poly A+ RNA was obtained from 
Clontech, California, USA and first strand cDNA 
25 synthesis performed with the Ready To Go T-Primed 
First-Strand Kit ( (Amersham pharmacia biotech, NJ, 
USA) . 

Primers were: 
for PCRl; 

30 OGC56 : CCGGAATTCCATATGTTAATACTGCCCTTCTGCTGCTAA 

OGC66 : GCGATCTCTGTAGTTGTGGCCTTG 
PGR program was: 

Step 1 94°C, 1 min 
Step 2 53°C, 30 sec 
35 Step 3 72°C, 2 min 

Repeat steps 1 to 3 40 times 
Step 4 72°C, 7 min 
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Srep 5 4°C 
for PCR2 

OGC63 : GGGAATTCCATATGTTGTTTGTGTATCGGAAGAATCATC 
5 OGC64 : ACGCGTCGACTTAJ\TACTGCCCTTCTGCTGCTAAGGAC 

OGC55 : CCGGAATTCCTTGTTTGTGTATCGGAAGAATCATC 
PGR program was: 

Step 1 94^C, 5 min 

Szep 2 92^C, 30 sec 

10 Step 3 55°C, 30 sec 

Step A 72^C, 2 min 

Repeat steps 2 to 4 for 25 cycles 

Snep 5 72°C, 7 min 

Step 6 4°C 

15 

The resulting PGR products were isolated, cloned 
and analysed as described in Example 1. 

SEQ ID NO: 7 shows the sequence of a PGR product 
isolated using the above PGR strategy. This PGR 

20 product was designated clone 3D. Figure 2 shows an 
alignment between the Rattus nor\/Ggicus unc-5Hl cDNA 
sequence, the sequence of EST ym97dl2, the sequence of 
clone 3D and the sequences of several other PGR 
products amplified using the above PGR strategy (IG, 

25 IJrc and 2Brc) . 



Example 3 

Cloning of two of the fragments of UNC-5 for the 
30 dimerization experiment . 

A PGR amplification was performed with following 
primers : 

UNC5F: GGT GGT CAT ATG GCC ATG GAG TGC TGT AAA CGT GGG 
3 5 AAT TCA AAA AAG 

UNC5R: GGC TGC AGG TCG ACG CCC GGG GGC TTA TGG GGA CAC 
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AAT TTG TGG 

Using the cDNA library used in the yeast two hybrid 
experiment (Example 4) as template. 

FCR program was: 

Step 1 94°C, 1 min 

Step 2 53°C, 30 sec 

Step 3 72''C, 2 min 

Repeat steps 1 to 3 for 25 cycles 

Step 4 72^C, 7 min 

Step 5 4''C 

The resulting PGR products were isolated and 
cloned in frame as Ncol/Sall fragments in the vectors 
pAS2 and pGAD424 supplied by Clontech (Palo Alto, 
California, USA) . 



Example 4 

Vesst two Hybrid Experiments 

To address the functional role of unc-5 the 
inventors used the yeast rwo hybrid method (Fields and 
Song, Nature 340:243, 1989), a method well known to 
molecular biologists, to search for the proceins that 
interact with the UNC-5 protein. 

The two hybrid m.ethod is based on a pair of 
fusion proteins. The first fusion protein comprises a 
first of rwo interacting proteins fused to the 
transcriptional activation domain of a bipartite yeast 
transcription factor; the second fusion protein 
com^prises the second of two interacting proteins fused 
lc the DNA binding domain of the bipartite yeast 
transcription factor. The principle of the method is 
that if the two domains of the bipartite transcription 
factor are physically brought together by binding of 
the first and second interacting proteins then the 
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resulting complex will be able to activate 
transcription from a promoter which contains a target 
binding site for the transcription factor . The two 
hybrid assay is commonly used to study protein-protein 
5 interactions between two known proteins. It can also 
be used to screen a library of proteins to identify 
proteins which interact with a given protein. Both of 
these uses of the two hybrid system are well known to 
those skilled in the art. 

10 In the present invention, the yeast two hybrid 

assay was used to identify proteins which interact 
with C. slogans UNC-5 or human UNC-5 as follows: the 
intracellular part of UNC-5 or parts thereof were 
cloned in fusion with the DNA-binding domain of the 

15 yeast transcription factor GAL4 . A cDNA library was 
cloned into a vector containing the transcriptional 
activation domain of GAL4 . The fusion proteins were 
then independently expressed together in yeast 
containing a reporter gene under the transcriptional 

20 control of a promoter containing GAL 4 binding sites 
(typically GALl lacZ or GAL1-HIS3) . 

Methods 

(A) Construction of the C. eleoans library and 
2 5 standard yeast two hybrid experiments. 

Construction of C. elegans cDNA libraries, and 
yeast two hybrid experiments with C. elegans cDNA were 
performed as described by Elledge et ai., Proc. Natl. 
Acad. Sci., 1991, 88:1731-1735, or using the 
30 Matchmaker™ maker system supplied by Clontech, 

California, USA according to the protocol supplied by 
the manufacturer, or by minor modifications of the 
above-described methods . 

35 

(B) A mating yeast two hybrid experiment. 

Mating yeast two hybrid experiments were 
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performed using plasmid pGC1037 (a plasmid map of 
pGC1037 is shown in Figure 8 and the complete sequence 
of the plasmid is given in SEQ ID NO: 91) as bait, and 
a pre-rransf ormed Human Brain MATCHMAKER cDNA library 
(Clontech, California, USA) according to the protocol 
supplied by the manufacture, or with minor 
modifications thereof . 



In brief summary, the steps of the method are as 
10 follows : - 

Inoculate 1 colony containing the bait plasmid into an 
overnight culture; 

Mate the bait culture and the library culture (24 h) ; 

Plate library mating mixtures; 
15 Incubate for at least 8 days; 

Streak big colonies onto SD-3 + 5mM AT-plates (+/- 

Nylon Membrane); 

Stain yeast on Nylon membrane; 

Prepare yeast DNA from the positives; 
20 Perform restriction digest, if digest is successful 

perform backtransf ormation, using positive and 

negative controls ; 

Transform posirives into MC1061 cells; 
Prepare bacterial DNA using Qiagen Plasmid Mini 
25 Purification kit, according to the standard Qiagen 
protocol; and 
Perform DNA sequencing. 



All positives obtained in the yeast two hybrid 
30 screen were assayed for the specificity of the 

interaction (against empty vector and irrelevant 
proteins) using the two hybrid system. 

(C) Double-stranded RNA inhibit ion-RNAi cloning 
3 5 isolation and injection. 

Double stranded RNA for RNA inhibition 
experiments was prepared according to the MEGAscript 
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protocol (Ambion, UK) . RNA isolated using this 
protocol was purified away from contaminants using the 
RNeasy system from Qiagen (Westburg, the Netherlands), 
following the instructions for RNA clean-up supplied 
5 by the manufacturer. RNA was injected into the 

nematodes using standard procedures (Methods in Cell 
biology. Vol 48, Academic Press, 1995) . 

Results 

^ 0 ( A) Auto-activation and dimerization experiments. 

In a first series of experiments, the ability of 
zhe intracellular domain of C. eJegans unc-5 or parts 
rnereof to dimerize or to cause auto-activation was 
tested. Several plasmids were constructed harboring 

15 rhe intracellular domains of unc-5 and parts thereof. 
Various domains of unc-5, including the membrane 
proximal part (MM?) , the zonula occludens homology 
domain (ZO-1), the unknown part (UP) and the Death 
domain (DD) and were cloned in the vectors pAS2 and 

20 pGAD424 (Matchmaker, Clontech, CA, USA) . The resulting 

vectors are summarized in Table 1. 

Several constructs containing the death domain 
were found to be either toxic or auto-activating. 
Furthermore, by performing homo-dimerizat ion 

25 experiments, it was found that the intracellular 

domain of UNC-5C is capable of forming a homo-dimer. 
Further experiments led to the conclusion that the 
ZO-l/UP region is probably responsible for the homo- 
dimerizat ion . Membrane located signal receptors often 

30 form homo- or hetero-dimer s prior to intracellular 

signal transduction. Accordingly, it is postulated 
that dimer formation in UNC-5 could be a critical 
event in signalling. Based on a knowledge of this 
dimerization it is possible to develop assays to 

35 screen for compounds which disrupt dimer formation and 
to identify unc-5 mutants which are unable to 
dimerize , 
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The present inventors have found that in humans 
UNC-5 proteins may be encoded by at least three genes, 
the homologous genes unc-5C, unc-5HSl, unc-5HS2. As 
UNC-5 is an important receptor involved in a vast 
5 amount of biological processes, it is considered that 
more functional homologous genes or unc-5 genes may 
present in the Homo sapiens genome. In addition, the 
expression of the unc-5 gene does not result in the 
production of a single transcript. The expression of 

10 unc-5C locus can result in the production at least 4 
iscforms as a result of alternative splicing events. 
It is possible that the other unc-5 genes will also 
express splice variants, which may encode different 
protein isoforms. Any of these unc-5 isoforms may form 

15 dimers, analogous to the homo-dimerisat ion found for 
C. elegans unc-5. Accordingly, assays can also be 
developed to screen for chemical substances that alter 
the dimerization of human unc-5 proteins. Compounds 
identified using such an assay may have 

20 pharmacologically useful properties. 

(B) Other receptor dimeri zat ions . 

It has been suggested that, in addition to UNC-6, 
UNC-129 also signals to the UNC-5 receptor (Colavita 

25 et al.. Science 261:706-709). UNC-6 is also known to 
signal to UNC-40{DCC). UNC-129 belongs to the TGF-p 
superfamily, TGF-p receptors, including DAF-1 and 
DAF-4, do not affect axonal guidance. Although new 
TGF-p receptors may be found that are involved in 

30 axonal guidance, it is more likely that the UNC-129 

molecule is able to interact with TSP type I domains, 
which are present in UNC-5. Such interaction between 
TGF-p molecules and TSP Type I domains has been shown 
previously ( Schult z-Cherry et al . , 1994, J. Biol. 

35 Chem. 269, 26775) . Furthermore UNC-129 is also 
involved in the UNC-40 pathway. 

Recent studies have provided support for the idea 
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that the UNC-5 receptor induces switching of UNC-4 0 
from attraction to repulsion (Mehlen et al. , Nature 
395:801-804, 1998). This suggests a linkage of Unc-5 
to oncology since Unc-40 is related to vertebrate DCC 
5 (deleted in colorectal cancer) , which is a candidate 

tumour-suppressor gene, and encodes a receptor for 
netrin-1 (UNC-6) . The reversal from attraction towards 
repulsion in growth cone steering with the two 
receptors UNC-5 and UNC-40 can be explained by 
10 hetero-dimerization between UNC-5 and UNC-40. Such 

switching of function has also been observed in other 
biological processes. The UNC-40/UNC-5 interaction 
may function analogously to the Bax/Bcl-2 interaction 
involved in apoptosis. Bax can be considered as the 
15 protein that protects against apoptosis but the 

relative titre of both Bax and Bcl-2 in a cell may be 
important in the decision of cell death- 
Given that UNC-5 is capable of forming 
homodimers, it is postulated that UNC-5 is also 
20 capable of forming heterodimers with UNC-40. The UNC- 
5/UNC-40 heterodimers may act as a functional receptor 
for UNC-6 and UNC-129. Assays to isolate compounds 
that influence the interaction between UNC-5 and 
UNC-40, both enhancing and inhibiting this interaction 
25 have therefore been developed. These assays are 
analogous to the assays as described to isolate 
compounds that influence the formation of the UNC-5 
dimers and the assays for compounds that influence the 
interaction of UNC-5 with its other interacting 
30 proteins (see below). 

(C) C. eleaans UNC-5 interacting proteins 

The intracellular part of UNC-5 containing the 
domains MPP, ZO-1 and UP cloned in vector pGC1003 (a 
35 plasmid map of pGC1003 is given in Figure 9 and the 
complete sequence of the plasmid is given in SEQ ID 
NO: 92) was used as 'bait' in a yeast two hybrid 
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experiment screening against a C. elegans cDNA 
library. These experiments resulted in the 
identification of ten genes, including three known 
genes and seven genes with heretofore unknown 
5 function, encoding proteins which specifically 
interact with the intracellular part of UNC-5. 
Details of the UNC-5 interacting proteins identified 
during the two hybrid screen are given below. In most 
cases, the results of double-stranded RNA inhibition 
10 experiments (RNAi) designed to inhibit expression of 
the interacting protein are also given. Where 
appropriate, details of human homologues of the 
interacting protein are also given and any known 
disease associations are discussed. 

15 

1) Spect:rin B-chain / Fodrin B-chain (pC1025) 

A first series of hits resulted in the 
identification of plasmid pC1025 which contains a 
fragment of a cDNA encoding the C. elegans spectrin 

20 p-chain/Fodrin . The spectrin p-chain protein is 

encoded by the gene K11C4.3, located on chromosome IV. 

The full length cDNA and amino acid sequences of 
spectrin p-chain/ Fodrin are shown in SEQ ID NOS : 11 
and 12, respectively. The nucleotide sequence of the 

25 fragment of the spectrin p-chain cDNA which is cloned 
as an insert in plasmid pC1025 is given in SEQ ID NO: 
13, the corresponding amino acid sequence is given in 
SEQ ID NO: 14. 

RNAi experiments using a double-stranded RNA 

30 corresponding to the cDNA fragment cloned in pC1025 
revealed that inhibition of the expression of the 
native spectrin p-chain in C, elegans worms causes the 
following phenotype: no embryonal lethality, normal 
canals, normal elongation, growth retardation and 

35 growth arrest at LI and L2, nearly no movement but 
touch reflex is observed. The phenotype is 100% 
penetrant, and the larvea are short and wrinkled. 
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These RNAi phenotypes and the corresponding knock-out 
phenotype can be used as the basis of a compound 
screen in C. elegans to identify chemical substances 
that modulate the activity of the spectrin p-chain 
protein . 

Human Fodrin (genbank accession number 2493434) 
contains an extra C-terminal PDZ domain that is not 
present in spectrin (genbank accession number 134798). 
The human fodrin seems to be more homologous to the C. 
elegans protein. This is in agreement with the finding 
that unc-5 is also expressed in the brain of 
vertebrates . 

The interaction between UNC-5 and fodrin could be 
a critical event in a cell signalling, hence compounds 
which modulate the interaction between UNC-5 and 
fodrin, particularly the interaction between human 
UNC-5 and human fodrin, may potentially have 
pharmacological activity. Assays can also be developed 
0 to screen for genetic mutations that inhibit the 

interaction needed for proper signal transduction. 
Compounds which enhance or inhibit the interaction of 
UNC-5 with fodrin and spectrin p-chain may be useful 
in the development of pharm.aceut ical preparations for 
5 the treatment of Crohn's disease, Sjogren's syndrome, 
secretion related diseases, diseases related to 
neutophil and platelet activation, and long-term 
potential in neurons, Alzheimer's disease, 
proliferative diseases such as carcinomas, neoplasia, 
0 and more specifically, shwannomas, meningiomas, 
ependymonas, squamous cell carcinomas, malignant 
melanomas and lung carcinomas, spherocytosis, 
pyropoikilocytosis, Duchenne muscular dystrophy and 
various neurological disorders. 



2) APR-1 (r>C1028) 

A second plasmid isolated in the yeast two hybrid 
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screen, pC1028, contained a fragment of a cDNA 
encoding APR-1 . 

The nucleotide sequence of the full length APR-1 
cDNA is shown in SEQ ID NO: 15 and the amino acid 
sequence of the APR-1 protein encoded by this cDNA is 
shown in SEQ ID NO: 16. The nucleotide sequence of 
the fragment of the APR-1 cloned in pC1028 is shown in 
SEQ ID NO: 17, with the corresponding amino acid 
sequence shown in SEQ ID NO: 18. 

RNAi experiments using a double-stranded RNA 
corresponding to the fragment cloned in pC1028 
demonstrated that inhibition of APR-1 expression in C, 
elegans results in the following phenotype: more than 
95% embryonic lethality, in 25% of cases this was due 
to the overproduction of pharyngeal tissue and lack of 
endoderm, and premature division of the E daughters 
(Rocheleau at ai.. Cell 90:707-716, 1997). Escapers 
(worms that survive) have abnormal gut cells. These 
RNAi phenotypes and the corresponding knock-out 
phenotype can be used as the basis of a compound 
screen in C, elegans to identify chemical entities 
that modulate the activity of APC (see below) , and 
hence the unc-5 pathway. 

Further yeast two hybrid experiments were 
performed in order to more precisely determine the 
position of the APR binding regions in UNC5, using the 
UNC5 domains MPP, ZO-1, UP and combinations thereof. 
APR-1 seemed to associate with two distinct regions in 
UNC5. First, APR-1 appears to bind to the MPP domain. 
Secondly, APR-1 appears to binding to the ZO-l/UP 
domain. APR-1 seems to bind less to the ZO-1 and UP 
domains when they are present alone and not in 
combination. A similar experiment was carried out 
using the C. elegans UNC-5 protein, and domains of 
human APC and analogous results were obtained. It is 
concluded that APC is capable of binding to two 
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distinct regions of UNC-5, the MPP and the ZO-l/UP 
domains . 

The interaction between UNC-5 and APC/APR-1 could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction, 
particularly compounds which modulate an interaction 
between human UNC-5 and human APC, may potentially 
have pharmacological activity. Furthermore genetic 
mutations, and splice variants can be identified that 
inhibit the interaction, needed for proper signal 
transduction. Compounds which enhance or inhibit the 
interaction of UNC-5 with APR/APC may be useful in the 
development of pharmaceutical agents for the treatment 
of neurological diseases and colorectal cancers such 
as adenomatous polyposis coli. 



3) UNC-14 (PC1034) 

A third plasmid identified during the yeast two 
hybrid screen using C. elegans UNC-5 as bait (pC1034) 
was found to contain a fragment of the UNC-14 cDNA. 

The nucleotide sequence of the full length UNC-14 
cDNA is shown in SEQ ID NO: 19, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 20, The nucleotide sequence of the 
5 fragment of the UNC-14 cDNA cloned as an insert in 
pC1034 is shown in SEQ ID NO: 21, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 22, 

C. eiegans worms mutated in unc-14 are observed 
0 to be very sluggish, almost paralysed, small, 

dumpyish, with a tendency to coil and show some egg 
retention. This phenotype can be used as the basis of 
a compound screen in C. elegans to identify chemical 
entities that modulate the activity of UNC-14. 
5 Furthermore, C. elegans worms mutated in the 

unc-14 gene were shown to have abnormal axonal 
elongation and axonal structures. The unc-14 gene 
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10 



15 



35 



encodes a protein of 665 amino acids, and is 
co-expressed with the unc-51 gene in the cell bodies 
and axons of almost all neurons including DD/VD and 
hermaphrodite-specific neurons. The results of yeast 
two-hybrid experiments suggested that a central region 
of UNC-14 binds to the carboxy-terminal region of 
UNC-51, and that the UNC-51 carboxy-terminal region 
oligomerized (Ogura et al . , Genes Dev. 11:1801-1811, 
1997) . 

Mutations in the unc-51 gene, isolated from 
mutants of Caenorhabditis elegans exhibiting abnormal 
axonal extension and growth, encodes a novel 
serinG/threcnme kinase (K. Ogura, et al., 1994, Genes 
Dev. 3 : 2389- 2400) . 



4) FllAlO.l (nGClQ21) 

A fourth plasmid isolated during the yeast two 
hybrid screen, pGC1021, was found to contain a 
fragment of cDNA corresponding to the C. elegans gene 
20 designated FllAlO.l, 

The nucleotide sequence of the full length 
FllAlC.l cDNA is shown in SEQ ID NO: 23, with the 
ammo acid sequence of the protein encoded by this 
CDNA shown rn SEQ ID NO: 24. The nucleotide sequence 
25 of the fragment of the FllAlO.l cDNA cloned in pGC1021 
is shown in SEQ ID NO: 25, the amino acid sequence of 
the protein fragment encoded by this fragment of the 
cDNA is shown in SEQ ID NO: 26. 

To date, no function is as yet known for 
30 FllAlO.l. RNAi experiments using a double-stranded 

RNA corresponding to the insert of pGC1021 showed that 
inhibition of FllAlO.l expression in C. elegans 
results in worms which are weakly constipated. In C. 
elegans, constipation has been associated with 
neuronal dysfunction (Thomas, Genetics 12 4: 855-872, 
1990) . Furthermore and remarkably inhibition of 
FllAlO.l expression causes migration defects in the 
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distal tip cell, similar to those observed in unc-5 
mutants and unc-14 /unc-51 double mutants. These RNAi 
phenotypes and the corresponding knock-out phenotypes 
can be used as the basis of a compound screen in C, 
5 elegans to identify chemical entities that modulate 
the activity of FllAlO.l. 

The interaction between UNC-5 and FllAlO.l could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
10 potentially have pharmacological activity. 

Furthermore, genetic mutations, and splice variants 
can be identified that inhibit the interaction, needed 
for proper signal transduction. Compounds which 
enhance or inhibit the interaction of UNC-5 with 
15 FllAlO.l may be of use in the development of 

pharmaceutical compositions useful in the treatment of 
neurological disorders, tumours such as Kaposi's 
Sarcoma, immunological disorders and diseases related 
to vesicle fusion, proteolysis, peroxisomal and 
20 mitochondrial biogenesis, and transcription. 

5) C15E6 .1/2 (pGC1026> 

A fifth plasmid identified during the yeast two 
hybrid experiment, pGC1026, was found to contain a 
25 fragment of a cDNA encoding the C15E6.1 protein. 

The nucleotide sequence of the full length 
C15E6.1/2 cDNA is shown in SEQ ID NO: 27, with the 
amino acid sequence of the protein encoded by this 
cDNA shown in SEQ ID NO: 28. The nucleotide sequence 
30 of the fragment of the C15E6.1/2 cDNA cloned in 

pGC1026 is shown in SEQ ID NO: 29, the amino acid 
sequence of the protein fragment encoded by this 
fragment of the cDNA is shown in SEQ ID NO: 30. 

RNAi experiments using a double-stranded RNA 
35 corresponding to the insert of pGC1026 did not result 
in any clear visual phenorype. 

The identification of C15E6,l/2 as an UNC-5 
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interacting protein indicates that QNC-S might be a 
band 4 . 1 binding protein and may share homology with 
otner band 4.1 binding proteins such as CD44, 
glycophrin C, and paranodin. 

By using the band 4,1 signature ro search a 
database of C.elegans genes, F07A11.1 on chromosome II 
was identified as encoding a band 4.1 protein. 

The interaction between UNC-5 and C15E6,l/2 could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
potentially have pharmacological activity. Furthermore 
genetic mutations, and splice variants can be 
identified that inhibit the interaction, needed for 
proper signal transduction. Compounds which enhance 
or inhibit the interaction of UNC-5 with C15E6.1/2 may 
be useful in the development of pharmaceutical 
preparations for the treatment of diseases related to 
axonal signalling, synaptic vesicle exocytosis, cell 
adhesion, cytoskeleton associated proteins, cell 
morphology , cell growth, allergic inflammatory 
processes and rheumatoid arthritis. 



6) D1081 . 7 (r>GC1027) 

A sixth plasmid identified during the two hybrid 
screen was found to contain a fragment of cDNA 
corresponding to the C. elegans gene designated 
D1081, 7 . 

The nucleotide sequence of the full length 
D1081.7 cDNA is shown in SEQ ID NO: 31, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 32. The nucleotide sequence of the 
fragment of the D1081.7 cDNA cloned as an insert in 
pGC1027 is shown in SEQ ID NO: 33, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 34. 

RNAi experiments performed using double stranded 
RKA corresponding to the insert in pGC1027 appeared 
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not to result in any clear visual phenotype. 

All genes so far found in C. elegans have human 
homoiogues. It is therefore expected that D1081.7 will 
also have vertebrate , including human, homoiogues . 
5 These homoiogues can be cloned using standard 
technologies . 

The interaction between UNC-5 and D1081.1 could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
10 potentially have pharmacological activity and thus be 
of use in the development of pharmaceutical 
compositions. Furthermore genetic mutations, and 
splice variants may be identified that inhibit the 
interaction needed for proper signal transduction . 

15 

7) B0238.9 (PGC1032) 

A seventh plasmid identified during the two 
hybrid screen was found to contain a fragment of cDNA 
corresponding to the C. elegans gene designated 
20 B0238.9. 

The nucleotide sequence of the full length 
B0238.9 cDNA is shown in SEQ ID NO: 35, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 36. The nucleotide sequence of the 
25 fragment of the B0238,9 cDNA cloned as an insert in 
pGC1032 is shown in SEQ ID NO: 37, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 38. 

B0238.9 is located in the chromosomal region 
30 where seu-2 is also located. The seu-2 was identified 
in suppressor screens of ectopically expressed unc-5 
and is considered to be involved in the unc-5 pathway 
(Colavita and Culotti, Dev. Biol. 194:72-85, 1998). As 
a gene has now been isolated that interacts with 
35 unc-5, it is high probable that B0238.9 is the same as 
seu-2. Mutations in seu-2 appeared not to have any 
visual phenotype , as was also observed in RNAi 
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experiments using a double stranded RNA corresponding 
to a fragment of B0238,9. The finding that SEU-2 is a 
suppressor and a binding partner to UNC-5 validates 
the importance of this interaction. Other known 
5 suppressors of ectopic unc-5 growth cone steering are 
unc-6, unc-40, unc-34, unc-44, unc~129, seu-2, 
and seu-3. Mutations in some of these genes show 
axonal guidance defects, unlike seu-2- 

Homology searches in the EST database with 
10 B0238-9 revealed the presence of at least two human 
ESTs with significant homology. The ESTs so found, 
nz77b06 and yu53g01, can be used as basis to clone the 
full length cDNA encoding the human homologue of 
B0238,9. 

15 The interaction between UNC-5 and B0238.9 could 

be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
potentially have pharmacological activity and thus may 
be useful in the development of pharmaceutical 

20 compositions. Furthermore genetic mutations, and 

splice variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 

8) ZC404.8 (PGC1033) 

25 An eighth plasmid identified during the two 

hybrid screen was found to contain a fragment of a 
cDNA corresponding to the C. elegans gene designated 
ZC404 .8. 

The nucleotide sequence of the full length 
30 ZC404.8 cDNA is shown in SEQ ID NO: 39, the amino acid 

sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 40. The nucleotide sequence of the 
fragment of the ZC404.8 cDNA cloned as an insert in 
PGC1033 is shown in SEQ ID NO: 41, with the 
35 corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 42. 

RNAi experiments using a double stranded RNA 
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corresponding to a fragment of this gene resulted in 
an embryonic lethal phenotype , The worms showed no 
elongation and only very little muscle activity, the 
hypodermis is clearly abnormal. 
5 Homology searches in the EST database with 

ZC404,8.9 revealed the presence of at least three 
numan ESTs with significant homology. The ESTs thus 
identified, qe69h03, zx61d04, and zdBSelO;. can be used 
~s basis to clone the full length cDNAs . 

IC The interaction between UNC-5 and ZC404.8 could 

re a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
rctentiaily have pharmacological activity and thus be 
useful in the development of pharmaceutical 

15 preparations. Furthermore genetic mutations, and 

splice variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 

9) vkl7a3 (pGC1023) 

20 A ninth plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
cDWA corresponding to the C. elegans gene designated 
ykl7a3. 

The nucleotide sequence of the fragment of the 
2 5 y kl 7a3 cDNA cloned as an insert in pGC102 3 is shown in 
SEQ ID NO: 43, with the corresponding amino acid 
sequence of the polypeptide encoded by this fragment 
shown in SEQ ID NO: 44. 

RNAi experiments using a double stranded RNA 
30 corresponding to a fragment of ykl7a3 resulted in the 
following phenotypes in C. elegans: Very slow growth, 
and the larvae get typical darker spots as they get 
older. Inhibition of ykl7a3 expression in some non 
wild- type genetic backgrounds leads to defective 
35 moulting, where the worm cannot escape from the old 
cuticle and therefore shrinks and stays in the L4 
stage . The defective moulting phenotype is also 
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observed when ykl7a3 expression is inhibited on a 
wild-type genetic background, although the phenotype 
is less prominent. Worms which escape the defective 
moulting phenotype show defects in vulva development, 
either lacking a vulva altogether or having a vulva 
which is non-functional. 

Homology searches in the Genbank database with 
ykl7a3 revealed the presence of at least one human 
homologue of this gene, designated KIAA0187. 

The interaction between UNC-5 and ykl7a3 
(KIAA0187) could be a critical event in cellular 
signalling and hence compounds which modulate this 
interaction may potentially have pharmacological 
activity. Furthermore genetic mutations, and splice 
variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 
Compounds which enhance or inhibit the interaction of 
UNC-5 with ykl7a3 may be of use in the development of 
pharmaceutical compositions for the treatment of 
CADASIL, artheriohepatic dysplasia, Alzheimer's 
disease, neoplasia such as T-cell acute lymphoblastic 
leukemia and certain cancers, such as pancreatic 
cancer and colon cancer. 

10) F41H10.3 (PGC1020) 

A tenth plasmid identified using the yeast two 
hybrid experiment was found to contain a fragment of a 
cDNA corresponding to the C. elegans gene designated 
F41H10, 3. 

The nucleotide sequence of the full length 
F41H10.3 cDNA is shown in SEQ ID NO: 45, the amino 
acid sequence of the protein encoded by this cDNA is 
given in SEQ ID NO: 46. The nucleotide sequence of 
the fragment of the F41H10.3 cDNA cloned as an insert 
in PGC1020 is shown in SEQ ID NO: 47, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 48. 
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r41H10.3 harbors a ATP/GTP binding domain. 

Worms resulting from RNAi experiments using a 
double stranded RNA corresponding to a fragment of 
F41H10.3 did not exhibit a clear visual phenotype. 
5 All genes so far found in C. eJegans have human 

homologues. It is therefore expected that F41H10.3 
will also have vertebrate, including human, 
homologues. These homologues can be cloned using 
standard technologies well known to persons skilled in 

10 the art. 

The interaction between UNC-5 and F41H10.3 could 
be a critical event in signalling and compounds which 
modulate this interaction may potentially have 
pharmacological activity and thus be useful in the 

15 development of pharmaceutical preparations. 

Furthermore genetic mutations, and splice variants may 
be identified that inhibit the interaction, needed for 
proper signal transduction. 

2 0 (Di Human UNC-5 interacting proteins. 

The intracellular part of the human UNC-5 protein 
(human UNC-5HS1) containing the domains ZO-1, UP and 
DD cloned in vector pGC1037 (see above) was used as 
^bait' in a yeast two hybrid experiment screening 

25 against a pretransf ormed human brain Matchmaker cDNA 
library (Clontech, Palo Alto, California USA) using 
the mating screen approach described above. These 
experiments resulted in the identification of six 
genes encoding proteins which interact with UNC-5, 

30 including two known genes and four heretofore unknown 
genes . 

All proteins found in this yeast two hybrid 
screen with the human UNC-5 were different to the 
proteins found in the screen with the C. elegans 
35 UNC-5- There are at least two reasons for this 

variation in the isolated proteins. First, the screens 
are not saturated, which means that not all possible 
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interacting proteins have been isolated, neither in 
the screen with the C. elegans UNC-5 nor in the screen 
winh the human UNC-5. Secondly, different 
intracellular fragments have been used in the screens. 
5 In the C. elegans UNC-5 screen, the intracellular 

domains MPP, ZO-1 and UP were used as bait, whereas in 
the human UNC-5 screen, the intracellular domains 
ZO-1, UP and DD were used as bait. Proteins with 
specific interaction patterns will not be isolated if 

10 the necessary interacting domain is missing, or if the 
optimal combination of domains is missing. This has 
been shown in the C. elegans UNC-5 interaction with 
APR. APR interacts clearly with the MPP domain and the 
domain combination ZO-1, UP, but interacts less 

15 efficiently to with domain combination MPP, ZO-1, 
although the MPP domain is present. APR binds 
efficiently to the domain combination MPP, ZO-1, UP. 

The human UNC-5 interacting proteins identified 
during the two hybrid screen are listed below. In 

20 each case, any known disease associations are 

discussed and genes/cDNAs encoding homologous C. 
elegans proteins are listed. 



1) i-beta-1 . 3-N-acet:vlaininyltans£erase (pYMPS) . 

25 A first plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
the cDNA encoding i-beta-1 , 3-N-acetylaminyltansf erase 

The nucleotide sequence of the full length 
i-beta-1, 3-N-acetylaminyltansf erase cDNA is shown in 

30 SEQ ID NO: 49, the amino acid sequence of the protein 
encoded by this cDNA is given in SEQ ID NO: 50. The 
partial nucleotide sequences of the fragment of the 
i-beta-1 , 3-N-acetylaminyltansf erase cDNA cloned as an 
insert in pYMPS are shown in SEQ ID NOs: 51 and 52, 

35 with the corresponding amino acid sequence of the 

polypeptide encoded by these partial sequences shown 
in SEQ ID NO: 53. 
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C. elegans has at least seven putative homologues 
of i-beta-1, 3-N-acet yiaminy 1 tans f erase , designated 
F2 2F7,6, C18G1.3, K0 9C8.4, F21H7.10, C5 4C8.2, F55H6.6 
and T15D6.4. cDNA and/or amino acid sequences for 
5 each of these putative homologues are given herein. 
Amino acid and nucleotide sequences for these 
homologues are given in SEQ ID NOS : 66 to 82. 

The interaction between UNC-5 and 
beta-1, 3-N-acetylglucosaminyltransf erase could be a 
0 critical event in signalling and hence compounds which 
modulate this interaction may potentially have 
pharmacological activity. Furthermore genetic 
mutations, and splice variants may be identified that 
inhibit the interaction, needed for proper signal 

15 transduction. Compounds which modulate the interaction 
of UNC-5 with beta-1, 3-N-acetylglucosaminyltransf erase 
may be useful in the development of pharmaceutical 
preparations for the treatment of synaptic cleft 
dysfunctions, vesicle transport dysfunctions, 

20 inflamation, various tumours and more particular in 

tumour cell adhesion, migration and invasion, such as 
pancreas cancer, squamous cell cancer, human breast 
cancer, thyroid neoplasms, colorectal carcinomas. 

2 5 2) new gene wi1::h slight: homology "bo neurexin 
Il-alpha-b (NHIX) (pYMP6) 

A second plasmid identified during the yeast two 
hybrid experiment Was found to contain a fragment of a 
cDNA corresponding to a new gene with slight homology 
30 to neurexin Il-alpha-b . The new gene was designated 
NHII, 

Partial nucleotide sequences for the fragment of 
cDNA cloned as an insert in pYMP6 are shown in SEQ ID 
NO: 54 (coding strand sequenced from one end of the 
35 insert of pYMP6 sequenced with forward primer) and and 
SEQ ID NO: 55 (non-coding strand sequenced from one 
end of pYMP6 with reverse primer) . The plasmid pYMP6 
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was deposited in the Belgian Co-ordinated Collections 
of Microorganisms (BCCM) , Universiteit Gent, K. L. 
Ledeganckstraat 35, B-9000, Gent, Belgium on 21 May 
1999 under accession number LMBP 3932. The cDNA 
5 insert (approximately ISOObp) can easily be excised 
from this plasmid by digestion with the restriction 
enzymes EcoRI and Xhol. Alternatively the cDNA insert 
sequence can be amplified by PCR using primers 
corresponding to the sequences for the ends of the 

10 insert given in SEQ ID NOS : 54 and 55. 

The interaction between UNC-5 and the new gene 
with homology to neurexin Il-alpha-b could be a 
critical event in signalling and hence compounds which 
modulate this interaction may potentially have 

15 pharmacological activity. 

3) New Gene with Mena homolocn/ (MHI) (pYMP17) 

A third plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of a 
20 cDNA encoding a protein sharing slight homology with 
the human mena protein. The new gene was designated 
MHI . 

Partial nucleotide sequences of the fragment of 
cDNA cloned as an insert in pYMP17 are shown in SEQ ID 

25 NC: 56, (coding strand sequenced from one end of the 
insert of pYMP sequenced with forward primer) and SEQ 
ID NO: 57 (non-coding strand sequenced from one end of 
pYMP with reverse primer) . An alignment between the 
amino acid sequence encoded by the insert of pYMP17 

30 and the mouse mena protein is shown in Figure 7. The 
plasmid pYiyiP17 was deposited in the Belgian Co- 
ordinated Collections of Microorganisms (BCCM) , 
Universiteit Gent, K. L. Ledeganckstraat 35, B-9000, 
Gent, Belgium on 21 May 1999 under accession number 

35 LM5P 3935. The cDNA insert (approximately lOOObp) can 
easily be excised from this plasmid by digestion with 
the restriction enzymes EcoRl and Xhol. Alternatively 
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the cDNA insert sequence can be amplified by PGR using 
primers corresponding to the sequences for the ends of 
the insert given in Figures 55A and 55B. 

C. elegans has at least one protein with homology 
5 to the new Mena homologue (MHl), encoded by the gene 
designated Y50D4 . Contig200 . The C. elegans gene, 
unc-34 (which maps with Y50D4) is known to suppress 
the axonal guidance defects induced by ectopic 
expression of the Netrin receptor UNC-5 (Colavita, A. 

10 et al.. Dev. Biol., 194:72-85, 1998.)- 

The interaction between UNC-5 and mena, members 
of this mena superfamily, unc-34, and Y50D4 . contig200 , 
could be a critical event in signalling and hence 
compounds which modulate these interactions may 

15 potentially have pharmacological activity and thus may 
be useful in the development of pharmaceutical 
compositions . 

4) Alpha-2 macroglobulin (pYMP30) 

20 A fourth plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
the human alpha-2 macroglobulin cDNA. 

The nucleotide sequence of the full length alpha- 
2 macroglobulin cDNA is shown in SEQ ID NO: 58, the 

25 amino acid sequence of the protein encoded by this 

cDNA is given in SEQ ID NO: 59. A partial nucleotide 
sequence for the fragment of the alpha-2 macroglobulin 
cDNA cloned as an insert in pYMP30 is shown in SEQ ID 
NO: 60. 

30 C. elegans has at least one homologue of alpha-2 

macroglobulin, designated ZK337.1, of which two splice 
variants designated ZK337.1a and ZK337.1b are known to 
exist , 

The interaction between UNC-5 and alpha-2 
35 macroglobulin could be a critical event in signalling 
and hence compounds which modulate this interaction 
may potentially have pharmacological activity. 
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Compounds which enhance or inhibit the interaction of 
UNC-5 with alpha-2 macroglobulin could be useful in 
rhe development of pharmaceutical substances. 

5 5) New gene 1 (pYMPll) 

A fifth plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of 
cDNA with no homology to any known human cDNA. 

Partial nucleotide sequences for the fragment of 

10 cDNA cloned as an insert in pYMPll are shown in SEQ ID 

NO: 61 (ceding strand sequenced from one end of the 
insert of pYMF sequenced with forward primer) and SEQ 
ID NO: 62 (non-coding strand sequenced from one end of 
pYMP with reverse primer) . The plasmid pYMPll was 

15 deposited m the Belgian Co-ordinated Collections of 
Microorganisms (BCCM) , Universiteit Gent, K. L, 
Ledeganckstraat 35, B-9000, Gent, Belgium on 21 May 
1999 under accession number LMBP 3933. The cDNA 
insert (approximately 2300bp) can easily be excised 

20 from this plasmid by digestion with the restriction 

enzymes EcoRI and Xhol. Alternatively the cDNA insert 
sequence can be amplified by PGR using primers 
corresponding to the sequences for the ends of the 
insert given in Figures 59A and 59B, 

25 The interaction between UNC-5 and the protein 

encoded by the insert of pYMPll could be a critical 
event m signalling and hence compounds which modulate 
this interaction may potentially have pharmacological 
activity and thus could be useful in the development 

30 of pharmaceutical substances. 



6) New gene 2 (pYMP12) 

A sixth plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of 
cDNA with no homology to any known human cDNA. 

Partial nucleotide sequences for the fragment of 
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cDNA cloned as an insert in pYMP12 are shown in SEQ ID 
NO: 63 (coding strand sequenced from one end of the 
insert of pYMP sequenced with forward primer) and SEQ 
ID NO: 64 (non-coding strand sequenced from one end of 

5 pYMP with reverse primer). The plasmid pYiyiP12 was 

deposited in the Belgian Co-ordinated Collections of 
Microorganisms (BCCM) , Universiteit Gent, K, L. 
Ledeganckstraat 35 , B-9000 , Gent , Belgium on 21 May 
1999 under accession number LMBP 3934, The cDNA 

0 insert (approximately 2000bp) can easily be excised 
from this plasmid by digestion with the restriction 
enzymes EcoRI and Xhol. Alternatively the cDNA insert 
sequence can be amplified by PCR using primers 
corresponding to the sequences for the ends of the 

5 insert given in Figures 6 OA and 60B . 

The interaction between UNC-5 and the protein 
encoded by the insert of pYMP12 could be a critical 
event in signalling and hence compounds which modulate 
this interaction may potentially have pharmacological 

0 activity and thus could be useful in the development 
of pharmaceutical substances . 

Example 5 

Yeast two hybrid compound screens 

5 Interactions of proteins leads to expression of a 

reporter protein p-galactosidase in a yeast two hybrid 
assay. An assay has been developed that is usable in 
96 or 384 well plates or microtiter plates with 
another number of wells. This assay is suitable for 

>0 high throughput compound screening. Optimal 

performance of the assay is dependent upon at least 
two important parameters : 

lysis of yeast cells and the choice of the 
(5-galactosidase substrate . 
J 5 The basic protocol for an assay in 96 or 384 we 11 

plates is as follows : 

A yeast strain containing the Escherichia coli 
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10 



lacZ gene under the control of the yeast Gal4 promoter 
IS grown overnight (with shaking at 230-270 rpm) then 
diluted with YPD medium to an OD600 of 0.2. Diluted 
cultures are grown for an additional 3-5 hr until 
mid-log phase. Yeast cells are then transferred to 
either 96- or 384-well plates (100 pl/well or 25 
pi/well, respectively) . Alternatively, cells can be 
cultured in the microtiter plates, eliminating the 
need for a pipetting step. 

The yeast cells are then either lysed by freeze 
and thaw method (liquid to freeze, 37°C water bath 
to thaw) or by use of a Lysis buffer (e.g.: 1% Lithium 
dodecyl sulphate, 100 mM EDTA and 10 mM Tris-HCl pH 
8.0). Non-lysed cells also give a signal, although the 
15 variability is increased if the cells are not lysed. 
Yeast cells can also be permeabili zed with various 
reagents such as isopropanol (15 %). 

The substrate sensitivity must be optimised for 
efficient detection in a screening process. 
20 Fluorescein di galactoside (FDG) is a typical low cost 
fluorescent reagent for the detection of 
p-galactosidase; it can be used for screening, 
although autof luorescent compounds can induce a non- 
desirable background leading to false positives. 
25 Alternative substrates are available that become 

luminescent upon p-galactosidase cleavage, thereby 
eliminating background problems. An example of such a 
substrate Galacton-Star® from Tropix. Typically about 
l\iM substrate is added and the plates are incubated at 
30 room temperature for 60 minutes. Fluorescence (for 
FDG) is then measured at 530 nm.. It is typically 
possible to detect as low as 100 cells per well. 

As an alternative to the use of p-galactosidase, 
secreted alkaline phosphatase can be used as a 
35 reporter gene. The use of secreted alkaline 
phosphatase gives equivalent sensitivity to 
galactosidase with the advantage that there is no need 
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to lyse the cells. Fluorescent substrates for alkaline 
phosphatase are available commercially from Sigma- 
Aldrich (Bornem, Belgium) or Molecular Probes (Eugene, 
OR, USA) . 

The test compound can be added at various stages 
of the above procedure. Generally, the compound is 
added on the plates onto which the yeast are plated. 
However, the compound can also be added during the 
second incubation in order to overcome toxicity 
ptoblems . As a control, it is important to check 
whether the compound slows down the growth of the 
yeast. This can be done using turbidity measurements. 

Example 6 

Detection of in i/ivo protein-protein interactions 
using fluorescence energy transfer (FRET). 

An in vivo FRET assay can be conveniently 
performed using two different mutants of GFP which 
absorb and emit light at different wavelengths and 
which have suitably overlapping emission/absorption 
spectra, such as EGFP (enhanced green fluorescent 
protein) and EBFP (enhanced blue fluorescent protein) . 
When rwo such variant GFPs are brought into close 
proximity, within a few nanometers distance, 
fluorescence energy transfer (FRET) can be detected. 
Such transfer is characterized by a reduction of 
fluorescence intensity of the donor fluorophore (EBFP) 
and re-emission of fluorescence at the acceptor 
fluorophore (EGFP) wavelengths . Therefore if each 
fluorophore is fused to a protein domain known to bind 
to the other the protein-protein interaction can be 
monitored in vivo using FRET . 

In a typical example, the APC binding domain of 
UNC-5 cloned in fusion with EBFP, whereas APC is 
cloned in fusion with EGFP in expression vectors 
suitable for use in the chosen host cell line or 
organism. When both fusion proteins are expressed in 
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a cell line or in C. elegans it is possible to monitor 
and quantify their in vivo interaction by irradiating 
Lhe cells/worms with light at 488nm. When the donor 
and acceptor fluorophore are brought into close 
proximity by binding of the two fusion proteins 
fluorescent energy transfer results in a measurable 
oecreased in fluorescence from the fluorescence donor 
at a wavelength within the emission spectrum of the 
Conor. In simple terms, what is measured is a 

1 'L' quenching phenomenon since light emitted by the donor 

rluorophore is trapped by the acceptor fluorophore. 
:.*E:- The experiment could also be performed by 
r^-oasuring fluorescence from the acceptor fluorophore 
but this is often less sensitive. 

15 Plasmid vectors containing both EGFP and EBFP are 

conamercially available from Clontech (Palo Alto, 
California, USA) . Information on the use of these 
vectors is also supplied by the manufacturer. 

20 

F:x ample 7 

Genetic and complementation screens in veast . 

UNC-5 expression in yeast cells results in a 
lethal phenotype, mainly because of the expression of 

25 a death domain. This observation was most clearly seen 
in the experiments with C. elegans UNC-5. Accordingly, 
assays can be developed to screen for compounds, 
interacting proteins and suppressors which alter the 
acrivity of UNC-5, particularly the activity of the 

30 death domain of UNC-5. These assays are analogous to 
those described by Xu and Reed (Mol, Cell 1998, 1:337- 
46) . 

(A) Compound screens. 
35 Yeast cells are transfected with a plasmid 

encoding the C. elegans or human unc-5 (including the 
death domain) , such as the vectors described in the 
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yeast two hybrid experiments. The transfected yeast 
cells are then placed in the wells of micotiter 
plates, and are exposed to the compounds under test. 
Compounds which reduce or inhibit the lethal phenotype 
5 of the yeast cells transfected with unc-5 are scored 
as hits. Such compounds will typically suppress the 
unc-5 lethal phenotype by interacting with UNC-5 
itself, or with UNC-5 interacting proteins , or with 
proteins in the UNC-5 pathway, or with proteins in 
10 parallel pathways. The selected compounds can be used 
in the development of pharmaceutical preparations. 

(B) Suppressor screens . 

Yeast cells are transfected with a plasmid 

15 encoding the C. elegans or human unc-5 (including the 
death domain) , such as the vectors described in the 
yeast two hybrid experiments. Furthermore, the yeast 
cells are transfected with a library expressing C. 
eiegans or human cDNA, such as the libraries described 

20 in the yeast two hybrid experiments. The transfected 

yeast cells are placed in the wells of micotiter 
plates, and allowed to grow further. This allows 
selection cDNAs, and hence genes and proteins, that 
reduce or inhibit the lethal phenotype of the yeast 

25 ceils transfected with the death domain of unc-5. 

Such proteins will interact with UNC-5, or with UNC-5 
interacting proteins, or with proteins in the UNC-5 
pathway, or with proteins in parallel pathways to 
cause suppression of the unc-5 lethal phenotype. 

30 The selected cDNAs genes and proteins can be used in 
the development of pharmaceutical preparations or in 
the development of assays to select for compounds that 
enhance their function or expression. 

35 

Example 8 

Cloning of a C. Gleaans gene starting from a C. 
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eleaans insert. 

If a fragment of a given gene or cDNA is known 
then further fragments of the corresponding full 
length gene and/or cDNA can be constructed can often 
5 be found using in silico techniques such as AceDB (see 
http: Wwww, Sanger , ac. uk) , or searching of the EST 
database. The full cDNA can be cloned using standard 
technology such as 5'/3' RACE or SLl/2 RT-PCR on worm 
total RNA and colony hybridization. An analogous 
10 strategy is followed to clone a full length gene 
and/or cDNA for vertebrate and hence Human DNA. 

Example 9 

Cloning of C. eleaans gene starting from a human 

1 5 insert . 

A full length C. elegans gene can be cloned 
starting from a human sequence. Using in sllico 
techniques, a homologue or an EST can be found. 
Standard molecular biology techniques can then be used 

20 to clone the full length C. elegans gene. If no 

homologous sequence can be found by simple database 
searching, it may be necessary to perform species 
hopping. An analogous strategy is followed to clone a 
full length gene and/or cDNA for vertebrate and hence 

25 Human DNA, starting from a C. elegans DNA sequence. 
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SEQUENCE LISTING 

SEQ ID NO: 1 nucleotide sequence of a part of the 

human unc-5Cb cDNA which encodes the 
intracellular region of the protein. 

SEQ ID NO : 2 amino acid sequence of the 

intracellular part of the human unc-5Cb 
protein encoded by the nucleotide 
1 ■ - sequence shown as SEQ ID NO: 1, 

SE2 ID NO: 3 nucleotide sequence of a part of the 

human unc-5Cc cDNA which encodes the 
intracellular region of the protein, 

SEQ ID NO : 4 amino acid sequence of the 

intracellular part of the human unc-5Cc 
protein encoded by the nucleotide 
sequence shown as SEQ ID NO: 3. 

20 

SEQ ID NO : 5 nucleotide sequence of a part of the 

human unc-5C8 cDNA which encodes the 
intracellular region of the protein. 

25 SEQ ID NO: 6 ammo acid sequence of the 

intracellular part of the human unc-5C8 
protein encoded by the nucleotide 
sequence shown as SEQ ID NO: 5. 

30 SEQ ID NO: 7 nucleotide sequence of the fragment of 

the human unc-5Hl cDNA cloned by PGR in 
Example 2. 



SEQ ID NO: 

35 



8 



predicted amino acid sequence for the 
human unc-5Hl protein, translation in 
frame 1 . 
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SEQ ID NO: 9 

5 SEQ ID NO: 10 

SEQ ID NO: 11 

10 

SEQ ID NO: 12 

15 SEQ ID NO: 13 

SEQ ID NO: 14 

20 

SEQ ID NO; 15 

25 

SEQ ID NO: 16 

SEQ ID NO: 17 

30 

SEQ ID NO: 18 

35 

SEQ ID NO: 19 



PCT/EPOO/05108 

- 58 - 

predicted amino acid sequence for the 
human unc-5Hl protein, translation in 
frame 2 . 

predicted amino acid sequence for the 
human unc-5Hl protein, translation in 
frame 3. 

nucleotide sequence of the C. elegans 
spectrin (3-chain/Fodrin cDNA, 

amino acid sequence of the C. elegans 
spectrin p-chain/Fodr in protein. 

nucleotide sequence of the fragment of 
the C. elegans spectrin |3-chain/Fodrin 
cDNA cloned in pC1025. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO; 13. 

nucleotide sequence of the C. elegans 
APR-1 cDNA. 

amino acid sequence of the C. elegans 
APR-1 protein. 

nucleotide sequence of a fragment of 
the C. elegans APR-1 cDNA cloned in 
pC1028 . 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 17 . 

nucleotide sequence of the C. elegans 
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unc-14 cDNA. 

amino acid sequence of the C. elegans 
unc-14 protein. 

nucleotide sequence of the fragment of 
the C. elegans unc-14 cDNA cloned in 
pC1034 . 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 21. 
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SEQ ID NO: 20 

5 

SEQ ID NO: 21 

10 SEQ ID NO: 22 

SEQ ID NO: 23 

15 

SEQ ID NO: 24 

20 SEQ ID NO: 25 

SEQ ID NO: 26 

25 

SEQ ID NO: 27 

30 

SEQ ID NO: 28 
SEQ ID NO: 29 

35 



nucleotide sequence 
EllAlO.l cDNA. 

amino acid sequence 
EllAlO.l protein. 



nucleotide sequence 
C15E6.1 cDNA. 

amino acid sequence 
C15E6 . 1 protein . 



of the C. elegans 



of the C. elegans 



of the C. elegans 



of the C. elegans 



nucleotide sequence of the fragment of 
the C. elegans C15E6.1 cDNA cloned in 
pGC1026. 



nucleotide sequence of the fragment of 
the C. elegans EllAlO.l cDNA cloned in 
pGC1021 . 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 25. 
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SEQ ID NO: 30 amino acid sequence of the polypeptide 

encoded by the cDNA fragment shown as 
SEQ ID NO: 29. 

5 SEQ ID NO: 31 nucleotide sequence of the C. elegans 

D1081.7 cDNA, 

SEQ ID NO: 32 amino acid sequence of the C. elegans 

D1081.7 protein. 

10 

SEQ ID NO: 33 nucleotide sequence of the fragment of 

the C. elegans 1081.7 cDNA cloned in 
pGC1027 . 

15 SEQ ID NO: 34 amino acid sequence of the polypeptide 

encoded by the cDNA fragment shown as 
SEQ ID NO: 33, 

SEQ ID NO: 35 nucleotide sequence of the C. elegans 
20 B0238.9 cDNA (seu-2). 

SEQ ID NO: 36 amino acid sequence of the C. elegans 

B0238,9 protein (seu-2). 

25 SEQ ID NO: 37 nucleotide sequence of the fragment of 

the C. elegans B0238.9 cDNA cloned in 
PGC1023 . 

SEQ ID NO: 38 amino acid sequence of the polypeptide 
30 encoded by the cDNA fragment shown as 

SEQ ID NO: 37. 

SEQ ID NO: 39 nucleotide sequence of the C. elegans 

ZC404,8 cDNA. 



35 



SEQ ID NO: 40 amino acid sequence of the C. elegans 

ZC404.8 protein. 
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nucleotide sequence of the C. elegans 
ZC404.8 cDNA cloned in pGC1033. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 41. 

nucleotide sequence of the fragment of 
the C. elegans ykl7a3 cDNA cloned in 
PGC1023. 
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SEQ ID NO: 41 

SEQ ID NO: 42 

5 

SEQ ID NO: 43 

10 

SEQ ID NO: 4 4 

15 

SEQ ID NO: 4 5 

SEQ ID NO: 4 6 

20 

SEQ ID NO: 4 7 

25 

SEQ ID NO: 48 



30 SEQ ID NO: 49 



SEQ ID NO: 50 

35 



amino acid sequence 
encoded by the cDNA 
SEQ ID NO: 43. 



amino acid sequence 
F41H10. 3 protein. 



of the polypeptide 
fragment shown as 



of the C. elegans 



nucleotide sequence of the fragment of 
the C. elegans F4 1H10.3 cDNA cloned in 
pGC1020. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 47. 

nucleotide sequence of the human i- 
beta-1 , 3-N-acetylaminyltransf erase 
cDNA. 

amino acid sequence of the human i- 
beta-1 , 3-N-acetylaminyltransf erase 
protein . 



nucleotide sequence of the C. elegans 
F41H1G.3 cDNA. 
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SEQ ID NO: 51 

5 

SEQ ID NO: 52 

10 

SEQ ID NO: 5 3 

15 

SEQ ID NO: 54 

20 

SEQ ID NO: 55 

25 

SEQ ID NO: 56 
30 SEQ ID NO: 57 

SEQ ID NO: 58 

35 

SEQ ID NO: 59 
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partial nucleotide sequence for the 
fragment of the human i-beta-1 , 3-N- 
acetylaminyltransf erase cDNA cloned in 
pYMP5 (forward primer, coding srrand). 

partial nucleotide sequence for the 
fragment of the human i-beta- 1 , 3-N- 
acetylaminyltransf erase cDNA cloned in 
pYMP5 (reverse primer, non-coding 
strand) 

partial amino acid sequence for the 
polypeptide encoded by the fragment of 
the i-beta-1 , 3-N- 

acetylaminyltransf erase cDNA cloned in 
pYMP5 . 

partial nucleotide sequence for the 
human cDNA fragment cloned in pYMPS 
(forward primer, coding strand). 

partial nucleotide sequence for the 
human cDNA fragment cloned in pYMP6 
(reverse primer, non-coding strand) . 

partial nucleotide sequence for the 
human cDNA fragment cloned in pYMP17 
(forward primer, coding strand), 

partial nucleotide sequence for the 
human cDNA fragment cloned in pYiy[P17 
(reverse primer, non-coding strand) . 

nucleotide sequence of the human alpha- 
2-macroglobulin cDNA . 

amino acid sequence of the human alpha- 
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SEQ ID NO: 60 

5 

SEQ ID NO: 61 

10 

SEQ ID NO: 62 

15 

SEQ ID NO: 63 
20 SEQ ID NO: 64 

SEQ ID NO: 65 

25 

SEQ ID NO: 66 

30 

SEQ ID NO: 67 
35 SEQ ID NO: 68 
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2-ruacroglobulin protein . 

partial nucleotide sequence for the 
fragment of the human alpha-2- 
macroglobulin cDNA cloned in pYMP30 
(reverse primer, non-coding strand) , 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMPll 
(forward primer, coding strand). 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMPll 
(reverse primer, non-coding strand). 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMP12 
(forward primer, coding strand). 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMP12 
(reverse primer, non-coding strand). 

amino acid sequence of the mouse APC-2 
cDNA. 

nucleotide sequence of a C. elegans I- 
beta-1, 3-N-acetylaminyltransf erase cDNA 
(F22F7. 6) . 

amino acid sequence of a C. elegans 1- 
beta-1 , 3-N-acetylaminyltransf erase 
protein (F22F7 . 6) . 

nucleotide sequence of the C. elegans 
alpha-2-macroglobulin cDNA ZK337 . la . 
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SEQ ID NO: 69 

SEQ ID NO: 70 

5 

SEQ ID NO: 71 

10 SEQ ID NO: 72 

SEQ ID NO: 73 

15 

SEQ ID NO: 74 

20 

SEQ ID NO: 75 

25 

SEQ ID NO: 76 



30 SEQ ID NO: 77 



SEQ ID NO: 78 

35 
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nucleotide sequence of the C. elegans 
alpha-2-macroglobulin cDNA ZK337.1b 

amino acid sequence of the C, elegans 
alpha-2-macroglobulin protein ZK337 . la . 

amino acid sequence of the C, elegans 
alpha-2-macroglobulin protein ZK337 . lb. 

cDNA sequence for the C. elegans I- 
beta-1 , 3-N-acetyiaminyltransf erase 
homologue C18C1.3. 

amino acid sequence for the C. elegans 
I -beta-1 , 3-N-acetylaminyltransf erase 
homologue C18C1 . 3 . 

cDNA sequence for the C. elegans I- 
beta-1 , 3-N-acetylaminyltransf erase 
homologue K09C8.4. 

amino acid sequence for the C. elegans 
I -beta-1 , 3-N-acetylaminyltransf erase 
homologue K09C8.4, 

amino acid sequence for the C. elegans 
I -beta- 1 , 3-N-acetylaminyltransf erase 
homologue F21H7 . 10 . 

cDNA sequence for the C. elegans 1- 
beta-1 , 3-N-acetylaminyltransf erase 
homologue C54C8.2. 

amino acid sequence for the C. elegans 
I -beta-1 , 3-N-acetylaminyltransf erase 
homologue C54C8.2. 
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SEQ ID NO: 79 



5 SEQ ID NO: 80 



SEQ ID NO: 81 

10 

SEQ ID NO: 82 

15 

SEQ ID NO: 83 

20 

SEQ ID NO: 



25 SEQ ID NO: 85 



SEQ ID NO: 86 

30 

SEQ ID NO: 87 

35 

SEQ ID NO: 88 
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cDNA sequence for the C. elegans 1- 
beta-1 , 3 -N-acetylaminyl transferase 
homologue F56H6. 6. 

amino acid sequence for the C. elegans 
I -beta-1, 3-N-acetylaminyl transferase 
homologue F5 6H6 . 6 . 

cDNA sequence for the C. elegans I- 
beta-1 , 3 -N-acetylaminyl transferase 
homologue T15D6 . 4 . 

amino acid sequence for the C. elegans 
I -beta-1, 3 -N-acetyiaminyl transferase 
homologue T15D6,4. 

amino acid sequence of the 
exrracellular part of the C. elegans 
unc-5 protein . 

amino acid sequence of the 
transmembrane region of the C. elegans 
unc-5 protein . 

amino acid sequence of the membrane 
proximal part of the C. elegans unc-5 
protein . 

amino acid sequence of the zonula 
occludens part of the C. elegans unc-5 
protein . 

amino acid sequence of a part of the C, 
elegans unc-5 protein of unknown 
function . 

amino acid sequence of the death domain 



1 
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of the C. elegans unc-5 protein. 

SEQ ID NO: 89 amino acid sequence of the human HSl 

protein . 

SEQ ID NO: 90 amino acid sequence of the human UNC5C 

protein . 

ID NO: 91 complete nucleotide sequence of plasmid 
PGC1037 . 

TE: ID NO: 92 complete nucleotide sequence of plasmid 

pGClOOB. 

SEQ ID NO: 93 amino acid sequence of C. elegans unc- 

40, 

SEI' ID NO: 94 nucleotide sequence of C. elegans unc- 

40. 

SEQ ID NO: 95 amino acid sequence of human unc-40. 
SEQ ID NO: 96 nucleotide sequence of human unc-40. 

ACCESSION NUMBERS : 



Human bera-fodrin cDNA-GenBank S65762 
30 Human beta-fodrin protein-swissprot Q01082 
Human APC-1 cDNA-GenBank M74088 
Human APC-1 protein-swissprot P25054 
Human unc-14 cDNA (KIAA0375 ) -GenBank AB002373 



20 



35 
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Human unc-14 protein (KIAA0375 ) -BAA208 30 
Human ykl7a3 cDNA (KIAA0187 ) -GenBank D80009 
5 Human ykl7a3 protein ( KIAA0187 ) -SPTREMBL : Q14 692 



TABLE 1 : Schematic representation of dimerisations of C. elegans unc-5. using 
constructions in pAS2 and pGAD424 
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Claims : 

1. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 2 or a sequence of amino 

5 acids which differs from that set forth in SEQ ID NO: 
2 only in conservative amino acid changes. 

2. A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 

10 1. 

3. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 1 or a fragment 
thereof . 



15 



25 



30 



4. An expression vector comprising the nucleic 
acid of claim 2 or claim 3. 

5. A host cell or organism transformed or 
transfected with the expression vector of claim 4. 

6. An antibody which is capable of specifically 
binding to the protein claxmed in claim 1 or an 
epitope thereof. 

7. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 4 or a sequence of amino 
acids which differs from that set forth in SEQ ID NO: 

4 only in conservative amino acid changes. 

8. A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 
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9. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 3 or a fragment 
thereof . 
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10. An expression vector comprising the nucleic 
acid of claim 8 or claim 9. 

11. A host cell or organism transformed or 
transfected with the expression vector of claim 10. 

12. An antibody which is capable of specifically 
binding to the protein claimed in claim 7 or an 
epitope thereof . 

13. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 6 or a sequence of amino 
acids which differs from that set forth in SEQ ID NO: 
c only in conservative amino acid changes. 

14 . A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 
13 . 

0 15. A nucleic acid comprising the sequence of 

nucleotides set forth in SEQ ID NO: 5 or a fragment 
thereof . 

16. An expression vector comprising the nucleic 
5 acid of claim 13 or claim 14. 

17. A host cell or organism transformed or 
transfected with the expression vector of claim 16. 

10 18. A.n antibody which is capable of specifically 

binding to the protein claimed in claim 13 or an 
epitope thereof. 

19. A method of identifying compounds which are 
lb capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
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method comprises: 

providing a host cell containing a DNA 
construct comprising a reporter gene operatively 
linked to a promoter regulated by a transcription 
factor having a DNA binding domain and an 
activating domain ; 

expressing in said host cell a first hybrid 
DNA sequence encoding a first fusion protein 
comprising an UNC-5 protein or a fragment thereof 
fused in-frame to either the DNA binding domain 
or the activating domain of the said 
transcription factor; 

expressing in said host cell a second hybrid 
DNA sequence encoding a second fusion protein 
comprising an interacting protein or a fragment 
thereof fused in-frame to either the DNA binding 
domain or the activating domain of the said 
transcription factor, such that when the first 
fusion protein comprises the activation domain of 
the said transcription factor the second fusion 
protein comprises the DNA binding domain of the 
said transcription factor and when the first 
fusion protein comprises the DNA binding domain 
of the transcription factor the second fusion 
25 protein comprises the activation domain; 

contacting the host cell with a sample of 
the compound under test; and 

detecting any binding of the UNC-5 protein 
or fragment thereof to the interacting protein or 
fragment thereof by detecting the production of 
any reporter gene product in the said host cell. 



20 



30 



20. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
35 UNC-5 protein to an interacting protein previously 

identified as binding to the said UNC-5 protein, which 
method comprises: 
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providing a transgenic cell or organism 
expressing a first fusion protein comprising an 
UNC-5 protein or a fragment thereof fused in- 
frame to a frrst genetically encoded fluorophore 
5 and a second fusion protein comprising an 

interacting protein or a fragment thereof fused 
in-frame to a second genetically encoded 
fluorophore, the first and second fluorophores 
being characterised in that the emission spectrum 
10 of one of the fluorophores overlaps with the 

absorption spectrum of the other fluorophore; 

measuring the amount of fluorescence emitted 
from the fluorophore having an emission spectrum 
which overlaps with the absorption spectrum of 
15 the other fluorophore; 

exposing the transgenic cell or organism to 
a compound under test; and 

detecting any change in the amount of 
fluorescence emitted fluorescence emitted from 
20 the fluorophore having an emission spectrum which 

overlaps with the absorption spectrum of the 
other fluorophore . 



21- A method of identifying compounds which are 
25 capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
method comprises: 

providing a first reaction component 
30 comprising a first protein linked to a solid 

support containing a scintillant and a second 
reaction component comprising a second protein 
which has been radioactively labelled, wherein 
the first and second proteins are an UNC-5 
35 protein or a fragment thereof and an interacting 

protein or a fragment thereof; 

bringing the first and second reaction 
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components into contact in an aqueous solution in 
the presence of a compound under tesr; and 

detecting binding of the first protein to 
the second protein by detecting light emission 
5 from the scintillant. 



22. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
10 identified as binding to the said UNC-5 protein, which 

method comprises: 

coating the wells of a microtiter plate with 
UNC-5 protein or a fragment thereof; 

contacting the UNC-5 protein or fragment 
15 thereof with an aqueous solution comprising an 

interacting protein or a fragment thereof, said 
interacting protein being labelled with a tag 
which is directly or indirectly detectable, and a 
compound under test; 
20 washing to remove the compound under test 

and any unbound tagged interacting protein; and 
detecting complexes of UNC-5 or a fragment 
thereof bound to the interacting protein or a 
fragment thereof by directly or indirectly 
25 detecting the presence of the tag. 



23. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
30 identified as binding to the said UNC-5 protein, which 
method comprises: 

exposing a cell or organism expressing UNC-5 
and overexpressing nucleic acid encoding an 
interacting protein to the compound under test; 
35 and 

screening for reversion of the 
overexpression phenotype of the cell or organism 
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to wild-type . 

24. A method as claimed in claim 23 wherein the 
organism is a nematode worm . 

5 

25. A method as claimed in claim 24 wherein the 
nematode worm is C. elegans. 

26. A method as claimed in claim 23 wherein the 
10 cell is a mammalian cell line, 

27. A method as claimed in any one of claims 23 
to 26 wherein the cell or organism further expresses a 
reporter gene encoding a reporter protein. 

15 

28. A method as claimed in claim 27 wherein the 
reporter protein is a fluorescent protein or a 
luminescent protein . 

20 29. A method as claimed in any one of claims 19 

to 28 wherein the UNC-5 protein is a C. elegans UNC-5 
protein. 

30 . A method as claimed in any one of claims 1 9 
25 to 28 wherein the UNC-5 protein is a human UNC-5 

protein . 

31. A method as claimed in claim 30 wherein the 
human UNC-5 protein is UNC-5C or a protein as claimed 

30 in any one of claims 1, 1, 13 or 71. 

32. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is a C. elegans 
UNC-5 protein or a fragment thereof. 
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33. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 
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UNC-40, 

34. A method as claimed in any one of claims 2S 
to 31 wherein the interacting protein is human UNC-40. 

35. A method as claimed in claim 34 wherein the 
UNC-40 protein comprises the sequence of amino acids 
sez. forth in SEQ ID NO: 95. 

36. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is a C. elegans 
spectrin p-cha i n /f odrin protein. 

37. A method as claimed in claim 36 wherein the 
spectrin- p-chain/f odrin protein comprises the sequence 
of amino acids set forth in SEQ ID NO: 12. 

36. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 
APR-1 . 

39. A method as claimed in claim 38 wherein the 
C. elegans APR-1 protein comprises the sequence of 
amino acids set forth in SEQ ID NO: 16. 

40. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 
UNC-14 . 

41. A method as claimed in claim 40 wherein the 
C. elegans UNC-14 protein com.prises the sequence of 
amino acids set forth in SEQ ID NO: 20. 

42. A m.ethod as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 24. 
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43. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 28. 

44. A merhod as claimed m any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of nucleotides set forth in SEQ ID NO: 32. 

45. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 36. 

46. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 40. 

47. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 44. 

48. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 46. 

49. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is a human UNC-5 
protein . 

50. A method as claimed in claim 49 wherein the 
human UNC-5 protein is UNC-5C or a protein as claimed 
in any one of claims 1, 1, 13 or 71. 

51. A method as claimed in any one of claim.s 29 
to 31 wherein the interacting protein is human i-beta- 
1, 3-N-acetylaminyltransf erase . 

52. A method as claimed in claim 51 wherein the 
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human i-beta-1, 3-N-acetylaminyltransf erase comprises 
the sequence of amino acids ser forth in SEQ ID NO: 
50. 

53. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 72 or 
claim 73. 

54. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 74 or 
claim 75. 

55. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is human alpha-2 
macroglobulin . 

56. A method as claimed in claim 55 wherein the 
alpha-2 macroglobulin comprises the sequence of amino 
acids set forth in SEQ ID NO: 59. 

57. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 76 or 
claim 77. 

58. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 78 or 
claim 79. 

59. A method of identifying compounds which 
reduce or inhibit the lethal phenotype associated with 
the expression of the UNC-5 death domain in yeast, 
which method comprises: 

exposing a yeast cell containing an 
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expression vector comprising nucleic acid 
encoding an UNC-5 prorein or a fragment thereof 
comprising the death domain to a compound under 
test ; 

5 allowing the yeast cells to grow in the 

presence of the compound; and 

screening for a reduction or inhibition of 
the lethal phenotype associated with the 
expression of the UNC-5 death domain in yeast. 

10 

60. A method as claimed in claim 59 wherein the 
UNC-5 prcnein is a C. eiegrans UNC-5 protein. 

61. A merhod as claimed in claim 59 wherein the 
15 UNC-5 protein is a human UNC-5 protein. 

62. A method as claimed in claim 61 wherein the 
human UNC-5 protein is a protein as claimed in any one 
of claims 1, 7 or 13 or 71. 

20 

63. A method of identifying suppressers of the 
lethal phenotype associated with the expression of the 
UNC-5 death domain in yeast, which method comprises: 

transfecting yeast cells containing an 
25 expression vector comprising nucleic acid 

encoding an UNC-5 protein or a fragment thereof 
comprising the death domain with a cDNA library 
cloned in a yeast expression vector; 

allowing the transfected yeast cells to grow 
30 for one or more cell divisions; and 

screening for reduction or inhibition of the 
lerhal phenotype associated with the expression 
of the UNC-5 death domain in yeast. 



35 



64. A method as claimed in claim 63, which 
method further comprises the steps of: 

identifying a transfected yeast cell 
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exhibiting a reduction or inhibition of the 
lethal phenotype associated with the expression 
of the UNC-5 death domain in yeast; and 

isolating the cDNA clone (s) present in the 
transfected yeast cell which is/are responsible 
for conferring reduction or inhibition of the 
lethal phenotype. 

65. A method as claimed in claim 63 or claim 64 
v;rerein the UNC-5 protein is a C, eleqans UNC-5 

p r C' t e i n . 

66. A method as claimed in claim 63 or claim 64 
wr:^rein the UNC-5 protein is a human UNC-5 protein. 

67. A method as claimed in claim 66 wherein the 
human UNC-5 protein is a protein as claimed in any one 
of claims 1, 7, 13 or 71. 

68. A method as claimed in claim 65 wherein the 
cZi:?-. library is a C. elegans cDNA library. 

69. A method as claimed in claim 66 or claim 67 
v;nerein the cDNA library is a human cDNA library, 

70. A nucleic acid com.prising the sequence of 
nucleotides set forth in SEQ ID NO: 7. 

71. A protein comprising a sequence of amino 
acids encoded by the nucleic acid molecule of claim 8. 

72. A nucleic acid which is obtainable by 
restriction enzyme digestion of the plasmid pYMP17 
wirh the restriction enzymes EcoRI and Xhol . 

73. A nucleic acid as claimed in claim 72 which 
comprises the sequence of nucleotides set forth in SEQ 
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ID NO: 56 and a sequence of nucleotides complementary 
to the sequence of nucleotides set forth in SEQ ID NO: 
57. 



74, A nucleic acid which is obtainable by 
restriction enzyme digestion of the plasmid pYMP6 with 
the restriction enzymes EcoRI and Xhol . 

75. A nucleic acid as claimed in claim 74 which 
comprises the sequence of nucleotides set forth in SEQ 
ID NO; 54 and a sequence of nucleotides complementary 
to the sequence of nucleotides set forth in SEQ ID NO: 
55. 



76. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 61 and a sequence 
of nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 62. 



77. A nucleic acid as claimed in claim 76 which 
is obtainable by restriction enzyme digestion of the 
plasmid pYMPll with the restriction enzymes EcoRI and 
Xhol. 



78. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 63 and a sequence 
of nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 64. 



79. A nucleic acid as claimed in claim 78 v;hich 
is obtainable by restriction enzyme digestion of the 
plasmid pYMP12 with the restriction enzymes EcoRI and 
Xhol. 
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80. A nucleic acid probe which is capable of 
hybridizing to the nucleic acid of claim 70 under 
conditions of high stringency. 
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81- An oligonucleotide comprising a sequence of 
10 or more consecutive nucleotides of the sequence of 
nucleotides set forth in SEQ ID NO: 7. 

5 82, An antisense nucleic acid which is capable 

of hybridizing to the sequence of nucleotides set 
forth in SEQ ID NO: 7 under conditions of high 
st rrngency . 

10 83. An expression vector comprising the nucleic 

acid of claim 70. 

84. A host cell or organism transformed or 
transfected with the expression vector of claim 83. 

15 

85. An antibody which is capable of specifically 
binding to the protein claimed in claim 71. 
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Name : 


UNC5C 


Len : 


1599 


Check : 


410 


Weight : 


0 


.76 


Name : 


UNC5C8 


Len: 


1599 


Check: 


1710 


Weight : 


0 


.76 


Name : 


UNC5CC 


Len : 


1599 


Check : 


5512 


Weight : 


1 


12 


Name : 


UNC5Cd(UNC5Cb) 


Len : 


1599 


Check : 


1388 


Weight : 


1 


37 


Name : 


Consensus 


Len : 


1599 


Check : 


7845 


Weight : 


4 


00 



UNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



LJNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
IJNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNC5Cd 
Consensus 



TTGTTTGTGT 
TTGTTTGTGT 
TTGTTTGTGT 
TTGTTTGTGT 
TTGTTTGTGT 

51 

CTCTTCGGCA 
CTCTTCGGCA 
CTCTTCGGCA 
CTCTTCGGCA 
CTCTTCGGCA 



ATCGGAAGAA 
ATCGGAAGAA 
ATCGGAAGAA 
ATCGGAAGAA 
ATCGGAAGAA 



CTCAATGGGG 
CTCAATGGGG 
CTCAATGGGG 
CTCAATGGGG 
CTCAATGGGG 



TCATCGTGAC 
TCATCGTGAC 
TCATCGTGAC 
TCATCGTGAC 
TCATCGTGAC 



GCTTTCAGCC 
GCTTTCAGCC 
GCTTTCAGCC 
GCTTTCAGCC 
GCTTTCAGCC 



TTTGAGTCAG 
TTTGAGTCAG 
TTTGAGTCAG 
TTTGAGTCAG 
TTTGAGTCAG 



TGTGAACATC 
TGTGAACATC 
TGTGAACATC 
TGTGAACATC 
TGTGAACATC 



50 

ATATTATTGA 
ATATTATTGA 
ATATTATTGA 
ATATTATTGA 
ATATTATTGA 

100 

AAGGCAGCAA 
AAGGCAGCAA 
AAGGCAGCAA 
AAGGCAGCAA 
AAGGCAGCAA 



150 

GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 

GACAAGA CC TCACGTCAGC TGCAGCCATG 

GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 
GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 
GACAAGAtct gctggctgta cccccagaCC TCACGTCAGC TGCAGCCATG 



151 

TACAGAGGAC 
TACAGAGGAC 
TACAGAGGAC 
TACAGAGGAC 
TACAGAGGAC 

201 

GACCAACTCT 
GACCAACTCT 
GACCAACTCT 
GACCAACTCT 
GACCAACTCT 



CTGTCTATGC 
CTGTCTATGC 
CTGTCTATGC 
CTGTCTATGC 
CTGTCTATGC 



CCAATTCTGG 
CCAATTCTGG 
CCAATTCTGG 
CCAATTCTGG 
CCAATTCTGG 



CCTGCATGAC 
CCTGCATGAC 
CCTGCATGAC 
CCTGCATGAC 
CCTGCATGAC 



ATCCACTGCC 
ATCCACTGCC 
ATCCACTGCC 
ATCCACTGCC 
ATCCACTGCC 



GTCTCAGACA 
GTCTCAGACA 
GTCTCAGACA 
GTCTCAGACA 
GTCTCAGACA 



CAACCTGAAA 
CAACCTGAAA 
CAACCTGAAA 
CAACCTGAAA 
CAACCTGAAA 



200 

AAATCCCAAT 
AAATCCCAAT 
AAATCCCAAT 
AAATCCCAAT 
AAATCCCAAT 

250 

ATCAAAGTGT 
ATCAAAGTGT 
ATCAAAGTGT 
ATCAAAGTGT 
ATCAAAGTGT 



251 300 

ACAACACCTC AGGTGCTGTC TCCCCCCAAG ATGACCTCTC TGAGTTTACG 

ACAACACCTC AGGTGCTGTC ACCCCCCAAG ATGACCTCTC TGAGTTTACG 

ACAACACCTC AGGTGCTGTC ACC 

ACAACACCTC AAGTGCTGTC ACCCCCCAAG ATGACCTCTC TGAGTTTACG 

ACAACACCTC AgGTGCTGTC aCCccccaag atgacctctc tgagtttacg 

301 350 
TCCAAGCTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 
TCCAAGCTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 

TCCAAGCTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 
tccaagctgt cccctcagat gacccagtcg ttgttggaga atgaagccct 
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351 400 
UNC5C CAGCCTGAAG AACCAGAGTC TAGCAAGGCA GACTGATCCA TCCTGTACCG 
UNC5C8 CAGCCTGAAG AACCAGAGTC TAGCAAGGCA GACTGATCCA TCCTGTACCG 

UNCSCc 

UNCSCd CAGCCTGAAG AACCAGAGTC TAGCAAGGCA GACTGATCCA TCCTGTACCG 
Consensus cagcctgaag aaccagagtc tagcaaggca gactgatcca tcctgtaccg 

401 450 

UNC5C CATTTGGCAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 

UNC5C8 CATTTGGCAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 

UNCSCc TATTGT TCCCAATTCA 

UNCSCd CATTTGGCAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 

Consensus catttggcag cttcaactcg ctgggaggtc acctTATTGT TCCCAATTCA 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



451 

GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 



TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 



CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 



ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 



500 

GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



501 

CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 



GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 



ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 



AACTATGAGG 
AACTATGAGG 
AACTATGAGG 
AACTATGAGG 
AACTATGAGG 



550 

CCACCCATGG 
CCACCCATGG 
CCACCCATGG 
CCACCCATGG 
CCACCCATGG 



UNC5C 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



551 

ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 



GACACTTTTG 
GACACTTTTG 
GACACTTTTG 
GACACTTTTG 
GACACTTTTG 



ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 



TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 



600 

GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



601 

GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 



CCCGCCCCGT 
CCCGCCCCGC 
CCCGCCCCGT 
CCCGCCCCGT 
CCCGCCCCGt 



CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 



ATGCATCACT 
ATGCATCACT 
ATGCATCACT 
ATGCATCACT 
ATGCATCACT 



650 

GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



651 700 
CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 
CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 
CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 

CAATACCGAG GACTGGAAAA TACTGCTC-- 

CAATACCGAG GACTGGAAAA TACTGCTCaa gaaccaggca gcacagggac 

701 750 
AGTGGGAGGA TGTGGTGGTG GTCGGGGAGG AAAACTTCAC CACCCCCTGC 
AGTGGGAGGA TGTGGTGGTG GCCGGGGAGG AAAACTTCAC CACCCCCTGC 
AGTGGGAGGA TGTGGTGGTG GTCGGGGAGG AAAACTTCAC CACCCCCTGC 

agtgggagga tgtggtggtg g cggggagg aaaacttcac caccccctgc 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



751 

TACATTAAGC 
TACATTAAGC 
TACATTCAGC 



TGGATGCAGA 
TGGATGCAGA 
TGGATGCAGA 



GGCCTGCCAC 
GGCCTGCCAC 
GGCCTGCCAC 



ATCCTCACAG 
ATCCTCACAG 
ATCCTCACAG 



800 

AGAACCTCAG 
AGAACCTCAG 
AGAACCTCAG 



tacatt age tggatgcaga ggcctgccac atcctcacag agaacctcag 



801 

CACCTACGCC 
CACCTACGCC 
CACCTACGCC 



CTGGTAGGAC 
CTGGTAGGAC 
CTGGTAGGAC 



ATTCCACCAC 
ATTCCACCAC 
ATTCCACCAC 



CAAAGCGGCT 
CAAAGCGGCT 
CAAAGCGGCT 



850 

GCAAAGCGCC 
GCAAAGCGCC 
GCAAAGCGCC 



cacctacgcc ctggtaggac attccaccac caaagcggct gcaaagcgcc 



851 

TCAAGCTGGC 
TCAAGCTGGC 
TCAAGCTGGC 



CATCTTTGGG 
CATCTTTGGG 
CATCTTTGGG 



CCCCTGTGCT 
CCCCTGTGCT 
CCCCTGTGCT 



tcaagctggc catctttggg cccctgtgct 



GCTCCTCGCT 
GCTCCTCGCT 
GCTCCTCGCT 

CTCGCT 

gctcCTCGCT 



900 

GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
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UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNCSC8 
UNCSCc 
UNCSCd 
Consensus 



UNCSC 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



901 

ATCCGAGTCT ACTGTCTGGA 
ATCCGAGTCT ACTGTCTGGA 
ATCCGAGTCT ACTGTCTGGA 
ATCCGAGTCT ACTGTCTGGA 
ATCCGAGTCT ACTGTCTGGA 



951 

ACATCTTGAG 
ACATCTTGAG 
ACATCTTGAG 
ACATCTTGAG 
ACATCTTGAG 

1001 

TTCATTTTAA 
TTCATTT-AA 
TTCATTTTAA 
TTCATTTTAA 
TTCATTTtAA 

1051 

GCCCATTCCC 
GCCCATTCCC 
GCCCGTTCCC 
GCCCATTCCC 
GCCCaTTCCC 

1101 

ATTTTACCAT 
ATTTTACCAT 
ATTTTACCAT 
ATTTTACCAT 
ATTTTACCAT 

1151 

CTCTGGAAAG 
CTCTGGAAAG 
CTCTGGAAAG 
CTCTGGAAAG 
CTCTGGAAAG 

1201 

GTGCGGCAGG 
GT-CGGCAGG 
GTGCGGCAGG 
GTGCGGCAGG 
GTgCGGCAGG 

1251 

GTCAGAGGAA 
GTCAGAGGAA 
GTCAGAGGAA 
GTCAGAGGAA 
GTCAGAGGAA 

1301 

CCATCACCAC 
CCATCACCAC 
CCATCACCAC 
CCATCACCAC 
CCATCACCAC 

1351 

CGGCAGAAGC 
CGGCAGAAGC 
CGGCAGAAGC 
CGGCAGAAGC 
CGGCAGAAGC 



AGACAGACGG 
AGAXXXXXXX 
AGACAGACGG 
AGACAGACGG 
AGAcagacgg 



AGGCAGCACC 
AGCANGCANC 
AGGCAGCACC 
AGGCAGCACC 
AGgcaGCAcC 



TCTGGAAGAG 
TCTGAAAGAG 
TCTGGAAGAG 
TCTGGAAGAG 
TCTGgAAGAG 



GTTTGGAGTG 
GTTTGGAGTG 
GTTTGGAGTG 
GTTTGGAGTG 
GTTTGGAGTG 



ATTTAGCCTG 
ATTTAGCCTG 
ATTTAGCCTG 
ATTTAGCCTG 
ATTTAGCCTG 



TGGAAGGAGA 
TGGAAGGAGA 
TGGAAGGAGA 
TGGAAGGAGA 
TGGAAGGAGA 



CCTACTGGCA 
CCTACTGGCA 
CCTACTGGCA 
CCTACTGGCA 
CCTACTGGCA 



GGTCACGGGG 
GGTCACGGGG 
GGTCACGGGG 
GGTCACGGGG 
GGTCACGGGG 



TCTGTAGCAG 
TCTGTAGCAG 
TCTGTAGCAG 
TCTGTAGCAG 
TCTGTAGCAG 



1401 

CTGGAGGATG CTGGCCCATA 

CTGGAGGATG CTGGCCCATA 

CTGGAGGATG CTGGCCCATA 

CTGGAGGATG CTGGCCCATA 

CTGGAGGATG CTGGCCCATA 

SUBSTITUTE SHEET 



TGACACCCAG 
TGACACCCAG 
TGACACCCAG 
TGACACCCAG 
TGACACCCAG 



GAGGACAGCT 

xxxxxxxxxx 

GAGGACAGCT 
GAGGACAGCT 
gaggacagct 



CACAACCTGC 
CNNCAAATGN 
CACAACCTGC 
CACAACCTGC 
CacaAccTGc 



CAAATTGCTG 
CAAATTGCTG 
CAAATTGCTG 
CAAATTGCTG 
CAAATTGCTG 



GATCTCAAAG 
GATCTCAAAG 
GATCTCAAAG 
GATCTCAAAG 
GATCTC7UVAG 



AACACAGTGG 
AACACAGTGG 
AACACAGTGG 
AACACAGTGG 
AACACAGTGG 



AGGGCAGATC 
AGG-CAGATC 
AGGGCAGATC 
AGGGCAGATC 
AGGgCAGATC 



TCGATTTGCC 
TCGATTTGCC 
TCGATTTGCC 
TCGATTTGCC 
TCGATTTGCC 



CCCAGTGCTT 
CCCAGTGCTT 
CCCAGTGCTT 
CCCAGTGCTT 
CCCAGTGCTT 



CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 



AGCTGAACCT 
AGCTGAACCT 
AGCTGAACCT 
AGCTGAACCT 
AGCTGAACCT 

(RULE 26) 



GATGCCCTGA 
GATGCCCTGA 
GGTGCCCTGA 
GATGCCCTGA 
GaTGCCCTGA 



CCTAGAAGAA 
XXXXXXXXXX 
CCTAGAAGAA 
CCTAGAAGAA 
cctagaagaa 



GCCTGTCAAT 
GCCTGTCAAT 
GCCTGTCAAT 
GCCTGTCAAT 
GCCTGTCAAT 



GCTAAATATC 
GCTAAATATC 
GCTAAATATC 
GCTAAATATC 
GCTAAATATC 



AAACCTGCAC 
AANCCTGCAC 
AAACCTGCAC 
AAACCTGCAC 
AAaCCTGCAC 



AGCTGGTTTG 
AGCTGGTTTG 
AGCTGGTTTG 
AGCTGGTTTG 
AGCTGGTTTG 



TTCCAGCTCA 
TTCCAGCTCA 
TTCCAGCTCA 
TTCCAGCTCA 
TTCCAGCTCA 



GCTGCTGGAT 
GCTGCTGGAT 
GCTGCTGGAT 
GCTGCTGGAT 
GCTGCTGGAT 



TCAGCATCCC 
TCAGCATCCC 
TCAGCATCCC 
TCAGCATCCC 
TCAGCATCCC 



CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 



GGACAGGTAC 
GGACGGGTAC 
GGACAGGTAC 
GGACAGGTAC 
GGACaGGTAC 



950 

AGGAAATTTT 
AGGAAATTTT 
AGGAAATTTT 
AGGAAATTTT 
AGGAAATTTT 

1000 
CCTAAGGCTC 
XXXXAGGTTT 
CCTAAGGCTC 
CCTAAGGCTC 
cctaAGGcTc 

1050 
TCACGATATC 
TCNCGATATG 
TCACGATATC 
TCACGATATC 
TCaCGATATc 

1100 
AGGAAATTCC 
AGGAAATTCC 
AGGAAATTCC 
AGGAAATTCC 
AGGAAATTCC 

1150 
TGCACCTTCA 
TGCACNTTCA 
TGCACCTTCA 
TGCACCTTCA 
TGCACCTTCA 

1200 
CAAACTCTGT 
CAAACTCTGT 
CAAACTCTGT 
CAAACTCTGT 
CAAACTCTGT 

1250 
ACTGCACCGT 
ACTGCACCGT 
ACTGCACCGT 
ACTGCACCGT 
ACTGCACCGT 

1300 
CCTGCGAACA 
CCTGCGAACA 
CCTGCGAACA 
CCTGCGAACA 
CCTGCGAACA 

1350 
TCTCCCTATC 
TCTCCCTATC 
TCTCCCTATC 
TCTCCCTATC 
TCTCCCTATC 

1400 
GAGGCCATGA 
GAGGCCATGA 
GAGGCCATGA 
GAGGCCATGA 
GAGGCCATGA 

1450 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
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UNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
UNCSCe 
UNC5CC 
UNCSCd 
Consensus 



1451 

TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 

1501 

CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 

1551 

AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 



ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 



CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 



CATGAAACGG 
CATGAAACGG 
CATGAAACGG 
CATGAAACGG 
CATGAAACGG 



ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 



CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 



TGGTGTCCTT 
TGGTGTCCTT 
TGGTGTCCTT 
TGGTGTCCTT 
TGGTGTCCTT 



TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 



CTGGCAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 



AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 



1500 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 

1550 
TCTTGGAAGA 
TCTTGGAAGA 
TCTTGGAAGA 
TCTTGGAAGA 
TCTTGGAAGA 

1599 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
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Multalin version 5.3.3 

Copyright I.N.R.A. France 1989, 1991, 1994, 1996 

Published research using this software should cite 

Multiple sequence alignment with hierarchical clustering 

F. CORPET, 1988, Nucl . Acids Res., 16 22, 10881-10890 

Symbol comparison table : blosum62 

Gap weight : 12 

Gap length weight : 2 

Consensus levels: high=90% low=50% 

Consensus symbols: 

! i s anyone of IV 

$ is anyone of LM 

% is anyone of FY 

# is anyone of NDQEBZ 



MSF: 2908 Check: 0 



Name : 


ratuncBhl 


Len 


2908 


Check 


8912 


Weight - 


0 


. 87 


Name : 


ym97dl2 


Len 


2908 


Check 


4745 


Weight - 


0 


. 87 


Name : 


IG 


Len 


2908 


Check 


1058 


Weight 


1 


. 05 


Name : 


IJrc 


Len 


2908 


Check 


508 


Weight 


1 


. 04 


Name : 


2Brc 


Len 


2908 


Check 


6768 


Weight 


: 1 


.04 


Name : 


3D 


Len 


2908 


Check 


8193 


Weight 


: 1 


. 13 


Name : 


Consensus 


Len 


2908 


Check 


6031 


Weight 


: 6 


.00 



// 



1 50 
ratuncShl ATGGCCGTCC GGCCCGGCCT GTGGCCAGTG CTCCTGGGCA TAGTCCTCGC 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



51 100 
ratunc5hl CGCCTGGCTT CGTGGTTCGG GTGCCCAGCA GAGTGCCACG GTGGCCAATC 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



101 150 
ratuncShl CAGTGCCCGG TGCCAACCCC GACCTGCTGC CCCACTTCCT GGTAGAGCCT 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



151 

GAGGACGTGT 



ACATTGTCAA GAACAAGCCG GTGTTGTTGG 



200 

TGTGCAAGGC 



SUBSTITUTE SHEET (RULE 26) 
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201 250 
ratuncShl TGTGCCTGCC ACCCAGATCT TCTTCAAGTG CAATGGGGAA TGGGTCCGCC 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

300 

ratuncShl AGGTCGATCA CGTAATTGAA CGCAGCACCG ACAGCAGCAG CGGATTGCCA 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

350 

ratuncShl ACCATGGAGG TCCGTATCAA CGTATCGAGG CAGCAGGTAG AGAAAGTGTT 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

400 

ratuncShl TGGGCTGGAG GAATACTGGT GCCAGTGTGT GGCATGGAGC TCCTCGGGTA 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

401 450 
ratuncShl CCACCAAAAG TCAGAAGGCC TACATCCGGA TTGCCTATTT GCGCAAGAAC 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

' 500 
ratuncShl TTTGAGCAGG AGCCACTGGC CAAGGAAGTG TCACTGGAGC AAGGCATTGT 
yni97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 

. 501 550 
ratuncShl ACTACCTTGT CGCCCCCCAG AAGGAATCCC CCCAGCTGAG GTGGAGTGGC 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 
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wo 00/73328 



PCT/EPOO/05108 



.7/18 



ratuncShl 
yTti97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



551 

TTCGAAkTGA 



GGACCTCGTG GACCCCTCCC TCGATCCCAA 



600 

TGTGTACATC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



601 650 
ACGCGGGAGC ACAGCCTAGT CGTGCGTCAG GCCCGCCTGG CCGACACGGC 



ratuncBhl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



651 700 
CAACTACACC TGTGTGGCCA AGAACATCGT AGCCCGTCGC CGAAGCACCT 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



701 750 
CTGCAGCGGT CATTGTTTAT GTGAACGGTG GGTGGTCGAC GTGGACTGAG 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



751 

TGGTCCGTCT 



GCAGCGCCAG CTGTGGGCGT GGCTGGCAGA 



800 

AACGGAGCCG 



ratunc5hl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



801 850 
GAGCTGCACC AACCCGGCAC CTCTCAACGG GGGCGCCTTC TGTGAGGGGC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



851 

AGAATGTCCA 



GAAAACAGCC TGCGCCACTC TGTGCCCAGT 



SUBSTITUTE SHEET (RULE 26) 
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GGATGGGAGC 
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ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



901 



950 



TGGAGTTCGT GGAGTAAGTG GTCAGCCTGT GGGCTTGACT GCACCCACTG 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



951 



1000 



^CGQ^KdCCQ^Q GAGTGCTCTG ACCCAGCACC CCGCAATGGA GGTGAGGAGT 



1001 



ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



1050 



GTCGGGGTGC TGACCTGGAC ACCCGCAACT GTACCAGTGA CCTCTGCCTG 



CAGTGA CCTCTGTGTA 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



3;^^^ 1100 

CACACCGCTT CTTGCCCCGA GGACGTGGCT CTCTACATCG GCCTTGTCGC 



CACACTGCTT CTGGCCCTGA GGACGTGGCC CTCTATGTGG GCCTNATCGC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



Predicted transmembrane region 

1101 1150 
TGTGGCTGTG TGCCTCTTCT TGCTGTTGCT GGCCCTTGGA CTCATTTACT 



CGTGGCCGNN TGCCTGGTCC TGCTGCTGCT TGTCCTCATC CTCGTTTATT 
tt c g cc ccctta 

1151 1200 
STCGCAAGAA GGAAGGGCTG GACTCCGATG TGGCCGACTC GTCCATCCTC 



GCCGGAAGAA GGAGGGGCTG GACTCANATG TGGCTGACTC GTCCATTCTC 
gcc aa gg g ga g t c ga c t t tc 

1201 1250 
ACCTCGGGCT TCCAGCCTGT CAGCATCAAG CCCAGCAAAG CAGACAACCC 



ACCTCAGGCT TCCAGCCCGT CAGCATCAAG CCCAGCAAAG CAGACAACCC 
cc a t t g cc t age a ca 



g c 



cc 



SUBSTITUTE SHFFT mi il F 9fi> 
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ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncBhl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
yin97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



12S1 1300 
CCACCTGCTC ACCATCCAGC CAGACCTCAG CACCACCACT ACCACCTACC 



CCTTGGGTTC -CCNTCAAGT GGTNNCANGG GGGTGGCCCT TGAA--TTCA 
ACTTGGGTTC -CCNTCAAGT TGT--CAATG GGGNGCCCCT --GA--ATCA 
CCATCTGCTC ACCATCCAGC CGGACCTCAG CACCACCACC ACCACCTACC 
cc t g tc cc tc ag g c g cc c ate 

1301 1350 
AGGGCAGTCT ATGTTCGAGG CAGGATGGAC CCAGCCCCAA GTTCCAGCTC 



AGGGCAGTCT NTGTCCCCGG CAGGATGGGC CCAGCCCCAA GTTCCAGCTC 
ag at tgt gg gg tgg c age c ccag 

13S1 1400 
TCTAATGGTC ACCTGCTCAG CCCACTGGGG AGTGGCCGCC ATACGTTGCA 

GCC ACAC--TGCA 
TCAG CCCCCTGGGT GGCGGCCGCC ACACACTGCA 



ACCAATGGGC ACCTGCTCAG CCCCCTGGGT GGCGGCCGCC ACACACTGCA 
aa g c cct tcag ccc cctggg g ggccgCC acac tGCA 

1401 1450 

CCACAGCTCA CCCACCTCTG AGGCTGAGGA CTTCGTCTCC CGCCTCTCCA 

CCACAGCTCT CCCACCTCTG AGGCCGAGGA GTTCGTCTCC CGCCTCTCCA 

CCACAGCTCT CCCACCTCTG AGGCCGAGGA GTTCGTCTCC CGCCTCTCCA 



CCAACCTNTG AGGCCNAGGA GTTCGNNTCC 
cCcacctctG aggcc AGGa gttCg tec 



CCACAGCTCT 
cCacagCtct 

14S1 

CCCAAAACTA 
CCCAGAACTA 
CCCAGAACTA 



CTT - TCGTTC 
CTT-CCGCTC 
CTT-CCGCTC 



CCTGCCCCGC 
CCTGCCCCGA 
CCTGCCCCGA 



GGCACCAGCA 
GGCACCAGCA 
GGCACCAGCA 



CGCCTTTCCA 
cGccT Tcca 

1500 
ACATGGCCTA 
ACATGACCTA 
ACATGACCTA 



CCCAGAACTA CTTNCGGTTC CTTGCCCCCA GGCNCCAGCA ACATGACCTT 
cccagaacTa ctT cgGttC ctTgccCcga GGc ccagca acAtGaCCT 



1501 

C--GGGACCT 
T--GGGACCT 
T--GGGACCT 



TCA-ACTTCC 
TCA-ACTTCC 
TCNNACTTCC 



TCGGGGG-CC 
TCGGGGG-CC 
TCGGGGG-CC 



GGCTGATGAT 
GGCTGATGAT 
GGCTGATGAT 



1550 
--CCCTAATA 
- -CCCTAATA 
--CCCTAATA 



ATGGGGACCT TTAAATTTCT TCGGGGGNCC GGNTTATGAA NCCCTAATTC 

gGGaCCT t acTTCc TcggggG CC Gg t atga cc atTc 

1551 1600 

CGGGGA--TC AGCCTCCT-C ATACCCCCGG ATGCCATCCC CC-GAGGAAA 

CAGGAA--TC AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 

CAGGAA--TC AGCCTCCT-C ATNCCCCCAG ATGCCATACC CC-GAGGGAA 

CAGGAA--TC AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 

CAGGAA--TC AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 

CAGGGAATTA AACCTTCTTA ATCCCCCCAA ATGCCANACC CCCGANGGAA 

CaGGaA Tc AgCCTcCT c ATcCCCCCag ATGCCAtaCC CC GAgGgAA 

n iRRimrri^ rhect (rule 26) 
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ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



rat uncBhl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



1601 

GATCT-ACGA 
GATCT-ATGA 
GATCT-ATGA 
GATCT-ATGA 
GATCT-ATGA 
NATCTNTTGN 
gATCT atGa 

1651 

CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
AACC 

cccctagctg 



GATCTACCTC 
GATCTACCTC 
GATCTGCCTC 
GATCTACCTC 
GATCTACCTC 
NAACTACCTT 
gAtCTaCCTc 



GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 



ACACTGCACA 
ACGCTGCACA 
ACGCTGCACA 
ACGCTGCACA 
ACGCTGCGCA 

A A 

AcgctgcacA 



CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 



AGCCAGAAGA 
AGCCGGAAGA 
AGCCGGAAGA 
AGCCGGAAGA 
AGCCGGAAGA 
ANCTTGANNA 
AgCcgGAagA 



CCAGTCGTTA 
CCCATCGTTA 
CCCATCGTTA 
CCCATCGTTA 
CCCATCGTTA 



1650 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
AGCCCGGAAA 
cGtgaGGttg 

1700 
GCTGTGGGCC 
GCTGTGGACC 
GCTGTGGACC 
GCTGTGGACC 
GCTGTGGACC 



gctgtcagac cctgctgagt cccatcgtta gctgtggacc 



1701 

CCCA-GGAGT CCTGCTCACC 
CCCT-GGCGT CCTGCTCACC 
CCCT-GGCGT CCTGCTCACC 
CCCT-GGCGT CCTGCTCACC 
CCCT-GGCGT CCTGCTCACC 



CGGCCAGTCA T-CCTTG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 



1750 

-CA ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 



ccct ggcgt cctgctcacc cggccagtca t cctgg ct atggaccact 



1751 

GT--GGAGAG CCCA-GCCCT 



GACAGC- 
GACAGC- 



GT--GGGGAG CCCA-GCCCT 

GT--GGGGAG CCCA-GCCCT -GACAGC 

GT--GGGGAG CCCA-GCCCT -GACAGC- 

GT--GGGGAG CCCA-GCCCT -GACAGC- 



-T GGAGTC-TGC GCCT- 

■T GGAGCC-TGC GCCT- 

•T GGAGCC-TGC GCCT- 

•T GGAGCC-TGC GCCT- 

T GGAGCC-TGC GCCT- 



gt ggggag ccca gccct gacagc t ggagcc tgc gcct 



1800 

- -CAA 
--CAA 
--CAA 

- -CAA 
-CAA 

caa 



1801 

AAAGCAG-TC 
AAAGCAG-TC 
AAAGCAG-TC 
AAAGCAG-TC 
AAAGCAG-TC 



CTGC-GAGGG CAGTTGGG- 

GTGC-GAGGG CAGCTGGG- 

GTGC-GAGGG CAGCTGGG- 

GTGC-GAGGG CAGCTGGG- 

GTGC-GAGGG CAGCTGGG- 



-AGGATGTGC 
-AGGATGTGC 
-AGGGTGTGC 
-AGGATGTGC 
-AGGATGTGC 



1850 
-TGCACCT-T 
-TGCACCT-G 
-TGCACCT-G 
- TGCACCT-G 
-TGCACCT-G 



aaagcag tc tgc gaggg cagctggg aggatgtgc tgcacct g 

11^^ 1900 

GGTGAGGAGT CACCTTCCCA CCTCTACTAC TGCCAGCTGG AGGCCGGGGC 

GGCGAGGAGG CGCCCTCCCA CCTCTACTAC TGCCAGCTGG AGGCCAGTGC 

GGCGAGGAGG CGCCCTCCCA CCTCTACTAA NTAAANCCCN AA-TTNTTGC 

GGCGAGGAGG CGCCCTCCCA CCTCTACTAC 

GGCGAGGAGG CGCCCTCCCA CCTCTACTAA G 

ggcgaggag cgccctccca cctctacta 

1901 1950 

CTGCTATGTC TTCACGGAGC AGCTGGGCCG CTTTGCCCTG GTAGGAGAGG 

CTGCTACGTC TTCACCGAGC AGCTGGGCCG CTTTGCCCTG GTGGGAGAGG 

AAAAATCCNT TTAAAATTGT NG--GNCCCN TTNAAACCTN 
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ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



-^^^^ 2000 

CCCTCAGCGT GGCTGCCACC AAGCGCCTCA GGCTCCTTCT GTTTGCTCCC 

CCCTCAGCGT GGCTGCCGCC AAGCGCCTCA AGCTGCTTCT GTTTGCGCCG 

CCCTTAAAAA GGGGCCCAAT TTCCNCCTMT NNGGNAWCCN - -TTNAAAAN 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2001 2050 

GTGGCCTGTA CGTCCCTTGA GTACAACATC CGAGTGTACT GCCTACACGA 

GTGGCCTGCA CCTCCCTCGA GTACAACATC CGGGTCTACT GCCTGCATGA 

NTAACTGGCC CCTNTTTTNA AAACNNNCGA NCNGGGNAAA NCC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2051 2100 
CACCCACGAC GCTCTCAAGG AGGTGGTGCA GCTGGAGAAG CAGCTAGGTG 
CACCCACGAT GCACTCAAGG AGGTGGTGCA GCTGGAGAAG CAGCTGGGGG 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2101 2150 
GACAGCTGAT CCAGGAGCCT CGCGTCCTGC ACTTCAAAGA CAGTTACCAC 
GACAGCTGAT CCAGGAGCCA CGGGTCCTGC ACTTCAAGGA CAGTTACCAC 



ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2^5^ 2200 
AACCTACGTC TCTCCATCCA CGACGTGCCC AGCTCCCTGT GGAAGAGCAA 
AACCTGCGCC TATCCATCCA CGATGTGCCC AGCTCCCTGT GGAAGAGTAA 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2201 2250 
GCTACTTGTC AGCTACCAGG AGATCCCTTT TTACCACATC TGGAACGGCA 
GCTCCTTGTC AGCTACCAGG AGATCCCCTT TTATCACATC TGGAATGGCA 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2300 

CCCAGCAGTA TCTGCACTGC ACCTTCACCC TGGAGCGCAT CAACGCCAGC 
CGCAGCGGTA CTTGCACTGC ACCTTCACCC TGGAGCGTGT CAGCCCCAGC 
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ratuncBhl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2301 2350 
ACCAGCGACC TGGCCTGCAA GGTGTGGGTG TGGCAGGTGG AGGGAGATGG 
ACTAGTGACC TGGCCTGCAA GGTGTGGGTG TGGCAGGTGG AGGGCGACGG 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2351 



2400 



GCAGAGCTTC AACATCAACT TCAACATCAC TAAGGACACA AGGTTTGCTG 
GCAGAGCTTC AGCATCAACT TCAACATCAC CAAGGACACA AGGTTTGCTG 



ratunc5hl 
yrn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2401 2450 
AATTGTTGGC TCTGGAGAGT GAAGGGGGGG TCCCAGCCCT GGTQQOCCCQ 
AGCTGCTGGC TCTGGAGAGT GAAGCGGGGG TCCAAGCCCT GGTGGGCCCC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2451 2500 
AGTGCCTTCA AGATCCCCTT CCTCATTCGG CAAAAGATCA TCGCCAGTCT 
AGTGCCTTCA AGATCCCCTT CCTCATTCGG CAGAAGATAA TTTCCAGCCT 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2501 2550 
GGACCCACCC TGCAGCCGGG GCGCCGACTG GAGAACTCTA GCCCAGAAAC 
GGACCCACCC TGTAGGCGGG GTGCCGACTG GCGGACTCTG GCCCAGAAAC 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2551 2600 
TTCACCTGGA CAGCCATCTT AGCTTCTTTG CCTCCAAGCC CAGCCCTACA 
TCCACCTGGA CAGCCATCTC AGCTTCTTTG CCTCCAAGCC CAGCCCCACA 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2601 26S0 
GCCATGATCC TCAACCTATG GGAGGCACGG CACTTCCCCA ACGGCAACCT 
GCCATGATCC TCAACCTGTG GGAGGCGCGG CACTTCCCCA ACGGCAACCT 
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racuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
>Tn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
yTn97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
IG 
IJrc 
2Brc 
3D 

Consensus 



2651 2700 
CGGCCAGCTG GCAGCAGCTG TGGCCGGACT GGGCCAACCA GATGCTGGCC 
CAGCCAGCTG GCTGCAGCAG TGGCTGGACT GGGCCAGCCA GACGCTGGCC 



2701 2750 
TCTTCACGGT GTCGGAGGCC GAGTGTTGA 

TCTTCACAGT GTCGGAGGCT GAGTGCTGAG GCCGGCCAGG CCCGACACCT 



2751 2800 
ACACTCTCAC CAGCTTTGGC ACCCACCAAG GACAGGCAGA AGCCGGACAG 



2801 2850 
GGGCCCTTCC CCACACCGGG GAGAGCTGCT CGGACAGGCC CCCTCCCGGC 



2851 2900 
CGAAGCTGTC CCTTAATGCT GGTCCTTCAG ACCCTGCCCC CTCGTGCCGA 



2901 

ATTCTGGC 
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gi|2Q5715 (M96376) neurexin Il-alpha-b [Rattus norvegicus] 31 7.4 

gi 1 205715 (M96376) neurexin Il-alpha-b [Rattus norvegicus] 
Length = 1728 

Score =31.3 bits (69), Expect =7.4 

Identities = 16/38 (42%), Positives = 20/38 (52%) 

Query: 337 KACSVCXAGRRALMGKLLEEQGXGVGGRGKANADI YYR 224 

KAC VC + GK LEE+G G G G+ lY + 

Sbjct: 1690 KACCVCRCRATCIAGKPLEERGGG-RGEGERQMQI YIK 172 6 



gi I 1644455 (U72520) mena protein [Mus musculus] 
Length = 541 

Score = 34,0 bits (76), Expect = 0.77 

Identities = 14/23 (60%), Positives = 15/23 (64%) 

Frame = +1 

Query: 31 PPPPCTCPAGRHRVSALPPPAGP 99 

PPPP P+G SALPPP GP 

Sbjct: 284 PPPPPPLPSGPAYASALPPPPGP 306 
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HAEHTOFE 
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HAmroPE 
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SEQUENCE LISTING 

<110> DEVGEN NV 

<120> UNC-5 constructs and screening methods 

<130> SCB/52877/002 

<140> 
<141> 

<160> 96 

<170> Patentin Ver, 2.0 



<210> 1 

<211> 1393 

<212> DNA 

<213> Homo sapiens 



<400> 1 

ttgtttgtgt 

ctcaatgggg 

cccccagacc 

gtctcagaca 

atcaaagtgt 

rccaagctgt 

aaccagagtc 

ctgggaggtc 

attccccaag 

ccacccatgg 

gctctgctca 

gactggaaaa 

ccaggatgcc 

agaacctaag 

tatcgcccat 

ccatgtttgg 

cctgaacaca 

gatctrccag 

ggatcctgcg 

tatccggcag 

gatgctggcc 

cccaactggc 

catgctggca 

agaagggcag 



atcggaagaa 

gctttcagcc 

tcacgtcagc 

aaatcccaat 

acaacacctc 

cccctcagat 

tagcaaggca 

accttattgt 

ggagagtcta 

atgactctca 

cccgccccgt 

tactgctcct 

ctgaaggaaa 

gctcttcatt 

tccctctgga 

agtggatctc 

gtggagctgg 

ctcaactgca 

aacaccatca 

aagctctgra 

cataagctga 

gtaatcctgg 

gctgtcttgg 

tat 



tcatcgtgac 

tgtgaacatc 

tgcagccatg 

gaccaactct 

aagtgctgtc 

gacccagtcg 

gactgatcca 

tcccaattca 

cgaaatgtat 

gacacttttg 

cgtcctcact 

cgctggagta 

ttttacatct 

ttaaaggcag 

agagcaaatt 

aaagaaacct 

tttgcaaact 

ccgtgtcaga 

ccacggtcac 

gcagcctgga 

acctggacag 

atctttggga 

aagaaatggg 



tttgagtcag 

aaggcagcaa 

tacagaggac 

ccaattctgg 

accccccaag 

ttgttggaga 

tcctgtaccg 

ggagtcagct 

gtgactgtac 

acccctgtgg 

atgcatcact 

cagcatccga 

tgagagacag 

cacccacaac 

gctggctaaa 

gcactgcacc 

ctgtgtgcgg 

ggaacctact 

ggggcccagt 

tgccccccag 

gtacttgaat 

agcacagaac 

aagacatgaa 



atattattga 

gacaagat ct 

ctgtctargc 

atccactgcc 

atgacctctc 

atgaagccct 

catttggcag 

tgctgattcc 

acaggaaaga 

tgagctgtgg 

gcgcagaccc 

gtctactgtc 

acgggaggac 

ctgcgcctgt 

tatcaggaaa 

tucactctgg 

caggtggaag 

ggcatcgatt 

gctttcagca 

acgagaggcc 

tactttgcca 

ttcccagatg 

acggtggtgt 



ctctucggca 60 
gctggctgta 120 
cctgcatgac 180 
caacctgaaa 24 0 
tgagtttacg 300 
cagcctgaag 360 
cttcaactcg 420 
cgctggggcc 480 
aactatgagg 540 
gcccccagga 600 
caataccgag 660 
tggargacac 720 
agctcctaga 78 0 
caattcacga 840 
ttccatttta 900 
aaagatttag 960 
gagaagggca 1020 
tgccgctgct 1080 
tcccuctccc 1140 
atgactggag 1200 
ccaaarccag 12 60 
gaaacctgag 132 0 
ccttagcagc 1380 
1393 



<210> 2 
<211> 238 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Leu Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp lie He 
1 5 10 



15 



Asp Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala 
20 25 30 
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Ala Ara Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala 
35 40 45 

Ala Mer Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys 
50 55 60 

lie Pro Met Thr Asn Ser Pro lie Leu Asp Pro Leu Pro Asn Leu Lys 
65 70 75 80 

lie Lys Val Tyr Asn Thr Ser Ser Ala Val Thr Pro Gin Asp Asp Leu 

85 90 95 

Ser Glu Phe Thr Ser Lvs Leu Ser Pro Gin Met: Thr Gin Ser Leu Leu 
100 " 105 110 

Glu Asn Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr 
115 120 125 

Asp Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His 
130 135 140 

Leu lie Val Pro Asn Ser Gly Val Ser Leu Leu lie Pro Ala Gly Ala 
145 150 155 160 

lie Pro Gin Gly Ara Val Tyr Glu Met Tyr Val Thr Val His Arg Lys 

165 170 175 

Glu Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro 
180 185 190 

Val Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val 
195 200 205 

Leu Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys lie 
210 215 220 

Leu Leu Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
225 230 235 



<210> 3 
<211> 1438 
<212> DNA 

<213> Homo sapiens 
<400> 3 

ttgtttgtgt atcggaagaa tcatcgtgac tttgagrcag atattattga ctctccggca 60 
ctcaatgggg gctttcagcc tgtgaacatc aaggcagcaa gacaagatct gctggctgta 120 
cccccagacc tcacgtcagc tgcagccatg tacagaggac ctgtctatgc cctgcatgac 180 
gtctcagaca aaatcccaat gaccaactct ccaattctgg atccactgcc caacctgaaa 240 
atcaaagtgt acaacacctc aggtgctgtc acctattgtt cccaattcag gagtcagctt 300 
gctgattccc gctggggcca ttccccaagg gagagtctac gaaatgtatg tgac^gtaca 360 
caggaaagaa actatgaggc cacccatgga tgactctcag acacttttga cccctgtggt 420 
gagctgtggg cccccaggag ctctgctcac ccgccccgtc gtcctcacta tgcatcactg 480 
cgcagacccc aataccgagg actggaaaat actgctcaag aaccaggcag cacagggaca 540 
gtgggaggat gtggtggtgg tcggggagga aaacttcacc accccctgct acattcagct 600 
ggatgcagag gcctgccaca tcctcacaga gaacctcagc acctacgccc tggtaggaca 660 
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3 

ttccaccacc aaagcggctg caaagcgcct caagctggcc atctttgggc ccctgtgctg 720 
ctcctcactq aagtacagca tccgagtcta ctgtctggat gacacccagg gtgccctgaa 780 
agaaatttta catctcgaga gacagacggg aggacagctc ctagaagaac ctaaggctct o^u 
^cattttaaa ggcagcaccc acaacctgcg cctgtcaatt cacgataccg cccgttccct 900 
ctggaagagc aaattgctgg ctaaatatca ggaaattcca ttttaccarg ttcggagtgg 960 
atctcaaaaa aacctgcact gcacctccac tctggaaaga tttagccrga acacagtgga 1020 
^r-tggtzrcz aaaczctazg tgcggcaggt ggaaggagaa gggcagatct tccagctcaa 1080 
crgcacca-lc ^.cagagaaac ctaccggcat cgatttgccg ctgctggatc ctgcgaacac 1140 
-atcac-a-g g-cacgggac ccagtgcttt cagcatccct ctccctaccc ggcagaagct 1200 
ctgtagcaac ccagacgccc cccagacgag aggccatgac tggaggatgc tggcccataa 1260 
actgaacctg aaiaggtact tgaattactt tgccaccaaa tccagcccaa ctggcgtaat 1320 
cctagatccu raggaagcac agaacttccc agatggaaac ctgagcatgc tggcagctgt 1380 
cttggaagaa atgggaagac atgaaacggt ggtgtcctta gcagcagaag ggcagtat 1438 

<210> 4 
<211> 130 
<212> PRT 

<213> Hoir.o sapiens 
<400> 4 

Teu Phe Vpl P-a Lvs Asn His Arg Asp Phe Glu Ser Asp He He 

1 ' ' 10 15 

;^sp Ser Se- Ala Leu Asr. Gly Gly Phe Gin Pro Val Asn He Lys Ala 
20 25 30 

Ala Arg Gin Asd Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala 
35 " 40 4S 

Ala Met Tyr Arg Giv Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys 
50 ' 55 60 

Tie Pro Met Tnr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys 
'65 70 -75 80 

Tie Lvs val Tvr Asn Thr Ser Gly Ala Val Thr Tyr Cys Ser Gin Phe 

es 90 

Arg Ser Gin Leu Ala Asp Ser Arg Trp Gly His Ser Pro Arg Glu Ser 
100 105 110 

Leu Arg Asn Val Cvs Asp Cys Thr Gin Glu Arg Asn Tyr Glu Ala Thr 
115 " 120 125 

His Gly 
130 



<210> 5 

<211> 1575 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ttgtttgtgt atcggaagaa tcatcgtgac 
ctcaatgggg gctttcagcc tgtgaacatc 
gcagccatgt acagaggacc tgtctatgcc 
accaactctc caattctgga tccactgccc 



tttgagtcag atattattga ctcttcggca 60 
aaggcagcaa gacaagacct cacgtcagct 120 
ctgcatgacg tctcagacaa aatcccaatg 180 
aacctgaaaa tcaaagtgta caacacctca 240 
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4 

ggtgctgrca ccccccaaga tgacctctct gagtttacgt ccaagcrgrc ccctcagatg 300 
acccaqtcgt tgttggagaa tgaagccctc agcctgaaga accagagtct agcaaggcag 360 
ac-gatccar cctgtaccgc atttggcagc ttcaactcgc tgggaggtca ccttattgtt 420 
cccaazzcaq gagrcagctt gctgatt-ccc gctggggcca ttccccaagg gagagtctac 480 
caaaiciatg tgactgtaca caggaaagaa actatgaggc cacccatgga tgactctcag 540 
-r-rrna cc.cctgtggt gagccgtggg cccccaggag ctctgctcac ccgccccgcc 600 
'zziizza rgcatcactg cgcagacccc aataccgagg actggaaaat actgctcaag 660 
:^-:::zac cacaqgqaca gtgggaggar grggtggrgg ccggggagga aaacttcacc 720 
acacraagct ggaugcagag gccrgccaca tcctcacaga gaacctcagc 780 
tggtaggaca ttccaccacc aaagcggctg caaagcgcct caagcrggcc 840 
ccctgtgctg ctcctcgctg gagtacagca tccgagtcta ctgtctggat 900 
argccctgaa ggaaatttra catcttgaga gannnnnnnn nnnnnnnnnn 960 
-r.y.r.r.r. nnnaggtttt tcatttaaag cangcanccn ncaaatgngc ctgtcaattc 1020 
.T,.':—^ ccaztccctc tgaaagagca aattgctggc taaatatcag gaaattccat 1080 
..*::*- irggagtgga tctcaaagaa ncctgcacLg cacnttcact ctggaaagat 1140 
cacagtggag ctggtttgca aactcrgtgt cggcaggtgg aaggagaagg 1200 
ZT-zji- agcucaactg caccgtgtca gaggaaccta ctggcatcga tttgccgccg 1260 
cqaacaccat caccacggtc acggggccca gtgctttcag catccctctc 1320 
t;^:::. agaagctctg tagcagcctg gatgcccccc agacgagagg ccatgactgg 1380 
,* cccataagct gaacctggac gggtacttga attactttgc caccaaatcc 1440 

: 7.zczaazcct ggatctttgg gaagcacaga acttcccaga tggaaacctg 1500 
.:,-:r-,.7:- cagccgtctt ggaagaaatg ggaagacatg aaacggtggt gtccrtagca 1560 
\^az2::j aacat 1575 



Z 1 1 > 5 2 o 

•:::2> PRT 

v21j. > Hor.':. sapiens 
< 4 0 0 > £ 

Leu ?ho Va : Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp lie lie 
1 5 10 15 

Asp Scr Sc-r Ala Leu Asn Gly Gly Phe Gin Pro Val Asn lie Lys Ala 
20 ^ 25 30 

Ala Arg Glr Asp Leu Thr Ser Ala Ala Ala Met Tyr Arg Gly Pro Val 
35 ^ 40 45 

Tyr Ala Leu His Asp Val Ser Asp Lys lie Pro Met Thr Asn Ser Pro 
50 55 60 

lie Leu Asp Pro Leu Pro Asn Leu Lys lie Lys Val Tyr Asn Thr Ser 
65 '70 75 ^ 80 

Gly Ala Val Ser Pro Gin Asp Asp Leu Ser Glu Phe Thr Ser Lys Leu 

85 90 95 

Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn Glu Ala Leu Ser Leu 
100 105 110 

Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro Ser Cys Thr Ala Phe 
115 120 125 

Gly Ser Phe Asn Ser Leu Gly Gly His Leu lie Val Pro Asn Ser Gly 
130 135 140 

Val Ser Leu Leu lie Pro Ala Gly Ala He Pro Gin Gly Arg Val Tyr 
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145 



150 155 160 



Glu Me- Tvr Val Thr Val His Arg Lys Glu Thr Met Arg Pro Pro Met 

165 1^0 175 

Asp Aso Ser Gin Thr Leu Leu Thr Pro Val Val Ser Cys Gly Pro Pro 
180 1S5 190 

G-v Ala Leu Leu Thr Arg Pro Val Val Leu Thr Met His His Cys Ala 
195 200 205 

Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu Lys Asn Gin Ala Ala 
210 215 220 

Gin Glv Gin Trp Glu Asp Val Val Val Val Gly Glu Glu Asn Phe Thr 
225 ' 230 235 240 

Thr Pm Cys Tvr He Lys Leu Asp Ala Glu Ala Cys His He Leu Thr 

245 250 255 

Glu A-n Leu Ser Thr Tyr Ala Leu Val Gly His Ser Thr Thr Lys Ala 
260 265 270 

Ala Ala Lys Arg Leu Lys Leu Ala He Phe Gly Pro Leu Cys Cys Ser 
275 280 285 

Ser Leu Glu Tyr Ser He Arg Val Tyr Cys Leu Asp Asp Thr Gin Asp 
290 295 300 

Ala Leu Lys Glu He Leu His Leu Glu Arg Gin Thr Gly Gly Gin Leu 
305 310 315 320 

Leu Glu Glu Pro Lys Ala Leu His Phe Lys Gly Ser Thr His Asn Leu 

325 330 335 

Arg Leu Ser He His Asp He Ala His Ser Leu Trp Lys Ser Lys Leu 
340 345 350 

Leu Ala Lys Tyr Gin Glu He Pro Phe Tyr His Val Trp Ser Gly Ser 
355 360 365 

Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu Arg Phe Ser Leu Asn 
370 375 380 

Thr Va^ Glu Leu Val Cvs Lvs Leu Cys Val Arg Gin Val Glu Gly Glu 
385 390 395 400 

Gly Gin He Phe Gin Leu Asn Cys Thr Val Ser Glu Glu Pro Thr Gly 

405 410 415 

Tie Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr He Thr Thr Val Thr 
420 425 430 

Gly Pro Ser Ala Phe Ser He Pro Leu Pro He Arg Gin Lys Leu Cys 
435 440 445 

Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly His Asp Trp Arg Met Leu 
450 455 460 
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Ala His Lys Leu Asn Leu Asp Arg Tyr Leu Asn Tyr Phe Ala Thr Lys 

465 470 475 480 

Ser Ser Pro Thr Gly Vai lie Leu Asp Leu Trp Glu A.la Gin Asn Phe 

485 490 495 

I-ro Asz- Gly Asn Leu Ser Met Leu Ala Ala Val Leu Glu Glu Me" Gly 
500 505 510 

Arg His Glu Thr Val Val Ser Leu Ala Ala Glu Gly Gin Tyr 
515 520 525 



<21C> 7 
<211> 813 
<212> DNA 

<213> Hon^.o s aniens 



<400> 7 

zagtgazczc 

natcgccgtg 

gaagaaggag 

gcccg ucagc 

cctcagcacc 

ccccaagttc 

actgcaccac 

gaactacttc 

cctcgggggc 

ccataccccg 

aggutgcccc 

-ggcgrcctg 

rgacagctgg 

gcaccucggc 



ugtgtacaca 
gccgnntgcc 
gggctggact 
a tcaagccca 
accaccacca 
cagctcacca 
agctctccca 
cgct ccctgc 
cggctgatga 
agggaagat c 
tagcrggctg 
ctcacccggc 
agcctgcgcc 
gaggaggcgc 



ctgctrctgg 
tggrcctgct 
canatgtggc 
gcaaagcaga 
cctaccaggg 
atgggcacct 
ccrctgaggc 
cccgaggcac 
tccctaatac 
targaga tct 
tcagaccctg 
cagtcatcct 
tcaaaaagca 
ccrcccactc 



ccctgaggac 
gctgcttgtc 
tgactcgtcc 
caacccccat: 
cagtctctgt 
gctcagcccc 
cgaggagttc 
cagcaacatg 
aggaatcaag 
acctcacgct 
ctgagtccca 
ggctatggac 
gtcgtgcgag 
act 



gtggccctct 
ctcatcctcg 
attctcacct 
ctgctcacca 
ccccggcagg 
ctgggtggcg 
gtctcccgcc 
acctatggga 
cctcctcatc 
tgcacaagcc 
tcgttagctt 
cactgtgggg 
ggcagctggg 



a tgTigggcct 
ttca utgccg 
caggcttcca 
tccagccgga 
a tgggcccag 
gccgccacac 
tctccaccca 
ccttcaactt 
cccccagatg 
ggaagacgtg 
gtggaccccc 
agcccagccc 
aggatgrgcr 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

813 



<210> 6 
<211> 265 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gin Pro Leu Cys Thr His Cys Phe Trp Pro Gly Arg Gly Pro Leu Cys 
15 10 15 

Gly Pro Xaa Arg Arg Gly Xaa Xaa Pro Gly Pro Ala Ala Ala Cys Pro 
20 25 30 

His Pro Arg Leu Leu Pro Glu Glu Gly Gly Ala Gly Leu Xaa Cys Gly 
35 40 45 

Leu Vai His Ser His Leu Arg Leu Pro Ala Arg Gin His Gin Ala Gin 
50 55 60 



;in Ser Arg Gin Pro Pro Ser Ala His His Pro Ala Gly Pro Gin His 
65 70 75 80 



His His His His Leu Pro Gly Gin Ser Leu Ser Pro Ala Gly Trp Ala 

85 90 95 
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In Prn nin Val Pro Ala His Gin Trp Ala Pro Ala Gin Pro Pro Gly 
100 105 110 

'o P^^g Pro Pro His Thr Ala Pro Gin Leu Ser His Leu Gly Arg Gly 
115 120 125 

-^-z Leu Pro Pro Leu His Pro Glu Leu Leu Pro Leu Pro Ala Pro 
V-i-. 135 14 0 

.•- Gir Gin His Asp Leu Trp Asp Leu Gin Leu Pro Arg Gly Pro 

• ■ ' " 150 155 160 

. ---so Pro Tyr Arg Asn Gin Ala Ser Ser Ser Pro Gin Met Pro 

' ■■ ■ ' 165 1^0 175 

- • - ~ u Gly Arg Ser Met Arg Ser Thr Ser Arg Leu His Lys Pro 
180 185 190 

- Arg Leu Pro Leu Ala Glv Cys Gin Thr Leu Leu Ser Pro 

■ : c.c 200 205 

• ■ •- Vc:' --^^r L^u Tro Thr Pro Trp Arg Pro Ala His Pro Ala Ser His 

2!:' " " 215 220 

^- .. ^-..^ pj-o Leu Trp Gly Ala Gin Pro Gin Leu Glu Pro Ala 

22b 230 235 240 

-ro Gin Lvs Ala Val Val Arg Gly Gin Leu Gly Gly Cys Ala Ala Pro 

245 250 255 

G"v Arg Gly Gly Ala Leu Pro Leu Thr 
260 265 



<210> 5 
<211> 2 66 



<212> PRT 

<213> Homo sapiens 

<400> 9 , , , 

Ser ASD Leu Cvs Val His Thr Ala Ser Gly Pro Glu Asp Val Ala Leu 
1 " ' 5 10 15 

Val Gly Xaa He Ala Val Ala Xaa Cys Leu Val Leu Leu Leu Leu 
20 25 30 

Val L=>u II Leu Val Tvr Cys Arg Lys Lys Glu Gly Leu Asp Ser Xaa 
35 ^ 40 45 

Val Ala Asp Ser Ser He Leu Thr Ser Gly Phe Gin Pro Val Ser He 
50 55 60 

Lvs Pro Ser Lys Ala Asp Asn Pro His Leu Leu Thr He Gin Pro Asp 
65 70 75 80 

Leu S'-r Thr Thr Thr Thr Thr Tyr Gin Gly Ser Leu Cys Pro Arg Gin 

85 90 95 „ 
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Asp Gly Pro Ser Pro Lys Phe Gin Leu Thr Asn Gly His Leu Leu Ser 
100 105 110 

Pro Leu Glv Gly Gly Arg His Thr Leu His His Ser Ser Pro Thr Ser 
115 120 125 

Glu Ala Glu Glu Phe Val Ser Arg Leu Ser Thr Gin Asn Tyr Phe Arg 
130 135 140 

Ser Leu Pro Arg Gly Thr Ser Asn Met Thr Tyr Gly Thr Phe Asn Phe 
145 150 155 160 

Leu Gly Glv Arg Leu Met lie Pro Asn Thr Gly lie Lys Pro Pro His 
165 170 175 

Pro Pro Arg Cvs His Thr Pro Arg Glu Asp Leu Asp Leu Pro His Ala 
180 185 190 

Cvs Thr Ser Arg Lys Thr Gly Cys Pro Leu Ala Val Arg Pro Cys Val 
195 200 205 

Pro Ser Leu Ala Cys Gly Pro Pro Gly Val Leu Leu Thr Arg Pro Val 
210 215 220 

He Leu Ala Met Asp His Cys Gly Glu Pro Ser Pro Asp Ser Trp Ser 
225 230 235 240 

Leu Arg Leu Lys Lys Gin Ser Cys Glu Gly Ser Trp Glu Asp Val Leu 
245 250 255 

His Leu Gly Glu Glu Ala Pro Ser His Ser 
260 265 



<210> 10 
<211> 266 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Val Thr Ser Val Tyr Thr Leu Leu Leu Ala Leu Arg Thr Trp Pro Ser 
15 10 15 

Met Trp Ala Xaa Ser Pro Trp Pro Xaa Ala Trp Ser Cys Cys Cys Leu 
20 25 30 

Ser Ser Ser Ser Phe He Ala Gly Arg Arg Arg Gly Trp Thr Xaa Met 
35 40 45 

Trp Leu Thr Arg Pro Phe Ser Pro Gin Ala Ser Ser Pro Ser Ala Ser 
50 55 60 

Ser Pro Ala Lys Gin Thr Thr Pro He Cvs Ser Pro Ser Ser Arg Thr 
65 70 75 80 

Ser Ala Pro Pro Pro Pro Pro Thr Arg Ala Val Ser Val Pro Gly Arg 

85 90 95 
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MetGiv Pro Ala Pro Ser Ser Ser Ser Pro Met Gly Thr Cys Ser Ala 
100 lUb 110 

pro T-D Val Ala Ala Ala Thr His Cys Thr Thr Ala Leu Pro Pro Leu 
115 120 125 

z,^^ Ser Pro Ala Ser Pro Pro Arg Thr Thr Ser Ala 

^ 13: '''^ 135 140 

p^o r,-^ Pro Glu Ala Pro Ala Thr Pro Mer Gly Pro Ser Thr Ser Ser 
i45 150 155 160 

Glv ^la Gl'^' Ser Leu He Gin Glu Ser Ser Leu Leu He Pro Pro Asp 
^ ~ ' 165 1^0 ^'^^ 

Ala He ^rg Gly Lvs He Tyr Glu He Tyr Leu Thr Leu Ala Gin 

180 ^ 185 190 

Ala Gl- Ara Arg Glu Val Ala Pro Ser Trp Leu Ser Asp Pro Ala Glu 
195 200 205 

Ser His I^-q Leu Val Asp Pro Leu Ala Ser Cys Ser Pro Gly Gin Ser 
210 215 220 

Ser ^-D Leu ^rp Thr Thr Val Gly Ser Pro Ala Leu Thr Ala Gly Ala 
225 * 230 235 240 

Cvs F^'a Ser Lvs Ser Ser Arg Ala Arg Ala Ala Gly Arg Met Cys Cys 

245 250 255 

Thr Tro Ala Arg Arg Arg Pro Pro Thr His 
260 265 



<210> 11 
<211> 6 9S1 
<212> DNA 

<213> Caenorhabditis eiegans 
<400> 11 

atgtcgrcgg tggatgacga gcatcatcct 
gaaagrgcar ctgactgtga aaaaactgaa 
gcgaaaaaat cattgcgaaa aaccaagaaa 
gcgggrgctc ULtcatctct ttcaaatggc 
gcrgaagtrr ctgattcagt atgtggacaa 
agaggattga gagatgaacg tgaactcgtg 
tcacacrrgg ctcgagtcag ctgcaaagta 
aaaatgctcc tccgactcct cgcagttctt 
ggaaaaatgc gtatccactg tcttgagaac 
caacacgtgc atctcgaaaa cttgggrtca 
acgttgggtc tcatctggac gatcatcctt 
gacgcrgaca atcacgagac acgttcagcg 
aaaactgctg gatatccaaa tgtcaacgtc 
cttgcattca atgcacttat tcataaacat 
caaaaatcga atgctcttta caatttgcaa 
ggatrggcca agttcctcga tgccgaagai: 
attatcactt atgtagttac atattatcat 



tcaattgaag ttgatattcc tgacgtttat 60 
gactgcacct atctaggatc tatrctcaag 120 
grcgctagtc ctggtatcta tccgggctcg 180 
ctagccgarg agaatagatc gccaaccagt 240 
aaat-ctataa attcagttga cttgaggttc 300 
caaaagaaaa catttacaaa atgggtcaac 360 
caagacctct acatggarat gcgagacgga 420 
tccggtgaac gtctgccgaa accgacgccg 480 
gttgagaaag gtttgcaatt cttgcgcaat 540 
cacgatatcg ttgacggaaa ttcacgcctc 600 
cgattccaga tccaagatat cacatttgaa 660 
aaggaggcct tgttactgtg gtgccagatg 720 
aagaacttct caaccagttg gagagatggt 780 
agacctgatc tcgtagacta cgataattta 840 
tctgcttttg ataccgcaga aaatcaattg 900 
gtgaacgttg accaacctga tgagaaatcc 960 
tacttcaaca aactgaaaca agataacatc 1020 
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caaggaaaac gtatcggaaa agtgattaar gagttgatgg agaacgacaa aatgarcaat: 1080 
cgatacgaaa ctctttcatc tgatttgctc gaatggatca atgccaagat ccaacttctc 1140 
aacgaacgtc atttrgagaa taatttggaa ggtgttcaaa gacagctcac cgagctcaac 1200 
gactaccgta crcaggagaa accaccaaag ttcgatgaaa agggagagct cgaagtactt 1260 
ctgttcactc ttcaatcagc tatgcgrgcc aacaatcaga gacctttcgt gccgagagaa 1320 
ggaaaactga ttgctgatan caacrgagca tggcaatccc ttgagaaggc agaacacgaa 1380 
cgagaacttg tcctgaaaga ggaactcart cgtcaagaga aactcgaaca actcgcagcc 1440 
cgactcaaca gaaaggccga gatgcgagag acttggciica cagaaaacca acggcttgtc 1500 
agtcaggaca atttcggaaa cgacttgtct tcggtggaag ctgccaccaa aaagcacgaa 1560 
gccatcgaga cggacatctt tgcttacgag gagcgtgttc aagccgtcgc cgctgT:tgct 1620 
ggagagctrg aagccgagaa ctaccatgat caagccaaaa tcaatgagag aaaggagaat 1680 
gttcttcaac tctggaacta tcttttccaa cttcttttgg ctcgcagagt tcgarcggag 1740 
ntgtcaatgg ccattcaaaa gatcttccac gatatgcttc ttactctcga tctcatggat 1800 
garattaaga gcagactttt aagcgaggat cttggagccc atcttatgga cgtagaagat 1860 
rtgcttcaaa agcacgctct tcttgagtct gatattaata ttattggaga acgagttaac 1920 
aattctattg ctcaagctca acgcttccga aatccagatg gaccagatgg aagtggatac 1980 
aagccagttg aaccaggaac cattgatgag agaagtgatg ttctacaaaa acgttacaag 2040 
gagcttcttg arttggctgc tgaacgtaag cgtcgtctcg aagacaacaa gagactttgc 2100 
cagttctggt gggatgttgc tgagcttgaa catggtatta aggaacaaga gcaagttctt 2160 
tcgtccactg atactggccg tgatattgtc actgtttcgc atttgttggc caaacataag 2220 
aacgctgaga acaaccttcg cgacttggag aagtacctcg atcgtcttga tgttrctgga 2280 
aaagaatrgc aagatgaaag catcccagga agtgacaaca ttcctccaag gctrgccgag 2340 
attcgtgatt acatcaacaa gctcaaggaa ctctctgctt ctcgcaaaga aagactcgcc 2400 
ggtggagtcg aatactacca attcttcacc gacgccgacg atgtcgatcg ctacttgtat 2460 
gatactctcc gcgtcatgtc atccgaagac gttggaaaag acgagggaac tgttcaactt 2520 
crcctcaaga agcacgacga tgtccatgac gagctccaga actttgacca acacatcaag 2580 
gttcttcacg ccaaggccga gtctcttcca caggaagcca gagaacatcc agatatcaga 2640 
caacgattgg acacaactct: caagcagaag gccgaactcg aaaacctttc tcaacttcgc 2700 
aagcagagac ttatcgatgc tctctccctg tacaagttgt actcggatgc cgattcagtc 2760 
gagtcttgga tcgatgagaa gggaaaactt ttggcaacat tggtccctgg acgcgatatc 2820 
gaagaggttg agatcatgaa gcatcgtttc gatactcttg agcaagatat gaagaatcaa 2880 
gaagctaagg ttactaacgt caatgacrtg gctcgtcaat tgcttaatgt ggaacatcca 2940 
aactccgacg acattcttca ccgtcaaaac aagctcaacg ctcgctgggc tcaacttcgc 3000 
gacatggtcg atcaaaagag aaacgagctc gagagagctc accgcctcga aactttccgc 3060 
atcgattgcc aagaaacagt tacttggatc gaggacaaga ctcgtgtact cgaggattcg 3120 
gacgcgtrga ccaacgatcr ttccggrgtc atgaaacttc aaagaagatt gtccatgatg 3180 
gaacgcgatt tgggagcgat tcaagccaag ctcgactcgc ttcacaaaga agccgatgac 3240 
atcgaaagag aacgaccaca agaagctcaa gccattcgtg aagatatcaa gagaarccac 3300 
caagtctggg atatccttaa caagaaagtt cgtgagcacg aagctaaact cgatgaagct 3360 
ggagatcttc aacgctrcct ccgtgatttg gatcacttcc aagcctggct tactgccacc 3420 
cagagacaag trgcctcaga ggaagagcca caatccttgg ctgaagctga gcaacttctc 3480 
aatcaacacg ccgccatccg tgaagagatt gacggatacg ctgaggatta caagaagatg 3540 
cgtgcaatgg gagatcgtgt cactcaagat caaaccgatc cacaatacat gttccttcgt 3600 
caacgtcttg ctggacttca agaaggatgg gaagagttgc agcgaatgtg ggacaacaga 3660 
caacatttgc tttctcaagg actcaacctt caaatgttcc ttcgcgatgc caaacaagca 3720 
gaggttatgc tctctcaaca agaaaactat ctcgccaagg acgatatccc acaatctttg 3780 
gagcaagccg agaaccaact caagagacac caagacttca tcaccactat ggatgccaac 3840 
gacgaaaaga ttcgcgccgt cggaatgttc ggtgatcagc tctgccaaga tggtcactat 3900 
gccgctgaca agacccataa gaaggctaga aacattgatg aacgtcgtgg agccaaccgc 3960 
gagaaggctc aagaagtact caagaagctg aaggacgcct tgtctctcca acaactcttg 4020 
cccgactgcg acgagctgag agaatggatc gaggagaaga tgatccgtgc tcaagatgag 4080 
acttacagag acgccaagac tatcacctcc aaattcgtcc gacatcaagc attccaatcc 4140 
gaacttgctg ctaacaagga acgtcttgac caacttaaac atgctgccat caaccttggt 4200 
gacgacaagc cagagtacca cggaaccatt gatccacaga tcgaggaact cgctactcaa 4260 
rgggatgagt tggaaaagac tacagaggag aagggacaga agttgtttga tgctaaccgc 4320 
caacaacttt acgtacaaag tattgctgac atgaaggaat gggcaaccca acttgagaac 4380 
gagatgactc gtgaagacca accaggagat cttaccactg tcaatgtggc catgcaaaaa 4440 
caacatttga tcgaaactga aatgatcaag aaggctcaac acattgatca actcatggag 4500 
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atggaqccac agctcgagga actccaccca gacgagcttg aaaacatcaa ggctcatcgt 4560 
ctaacaattc aagaacaact tcaaagactc caggctccat tggacgaccg acgtaaggca 4 520 
ttggaacgca agaaggccgc cttccaattt ggtcgcgatg ttgatgacga gaaatcgugg ^oou 
atttctgaga gacttgttct cgctaaagct caaaatcttg gagaaagcct tccagattgt 4,40 
catcgtctcc agaagaacct acaacttctt tccaatgaaa tcgacaacca cgaaccatgg 4800 
atcaaccaaa rctataataa tgqtcaagag ttgattgacg aaggacacgc taacggacca 4860 
acattcqaga agaagattca agaactgcgc agtgcttggc aagagtcgaa ggaagccgtc 4920 
aaagacagaa agggagatct tggagaaagc gagaaggctc atcaattctr gtatgattgc 4980 
ggagaggctg aagcctggat gagcgaacaa gaattgtaca tgatgcaaga cgaaagagga 5040 
aaagacgagt tctcaaccaa gaaccaaatc aagaaacacg aacgccttca atccgatarc 5100 
aacaaatttg ctgatacgat ccgcgctctt gccaccaagg ctcacaagtc cgtcgaagag 5160 
aagtcaccac tcaccgaaca gatccaagtt cgtcaagctc aaatcgagaa gctctacgca 5220 
ggtcttcaag atttgtcgaa ggagcgcaga aagagactcg aagagacctt ggaactttac 5280 
actcttcatc gtgaaatcga cgatcttctt caatggatcg ccgacaagga agttgttgca 5340 
ggaagtcaag agaacggaca agactacgaa cacgtccaaa tgcttcaaga gagattccaa 5400 
cagtttgctc gtgacaccga aaacatcgga tccgaacgtg ttgccaatgc taatgatgga 5460 
rgtgatactc tcattggtca cggacacact gacgctccga ctatagcatt gtggaaggat 5520 
tcgttgaatg aggcctggga gaatcttctg gagttgatgg acactcgtgc ccaaattcrc 5580 
qaggcttcta gacttcttca caagttctac cacgactgcc gtgattgctt gtcaagaatc 5640 
atggaaaaga ctcatgctat gcctgatgat cttggacgtg actcaagcag tgttggtgca 5700 
ctctccagaa aacatcagaa ctatctcaag gatattgctg caattggaga acaagttgct 5760 
caaatcgaaa gagatgccgc agagcttcga gatggttacg ctggagacaa ggctcttgat 5820 
atcggatctc gcgaaagcga ggttgtcaag gcatggaggc atctccgcgg actctgtgac 5880 
gcccgtacat ctcgtctcat ggatacttct gatcttttca agttcatgaa tatggtccgt 5940 
gatttgttgt tgtggatgga tgaggtgaag agagaaatga actcacaaga acgtccgaag 6000 
gatgtgagcg gagtcgagtt gctcatgaac aatcatcaat cgctcaaggc tgaaatcgat 6060 
gctcgcgagg agaacttcaa cgcgtgcatc tcattgggac gagatctgct caacagaaaa 6120 
cactacacat caagtgaaat cgagaagaag ctgatcaagt tgacaaccga aagagccgaa 6180 
atgatgcgca gatgggaaga tcgttgggaa tatcttcagc tcattctgga agtgtaccaa 6240 
ttcgcccgcg acgctgctgt cgccgaatct tggctcttcg cccaagaacc atatcttatu 6300 
tcgaaggaat acggtagaaa cctcgaagaa accatcaaac tcatcaagaa acacgaggcg 6360 
ttcgagaaat cggcattcgc acaagaagaa cgtttcttgg cgctcgagaa attgacaacg 6420 
ttcgaattga aagaaactca acaccgcgag gaggagaccg ccaagcgacg tggtccagct 6480 
catattggaa gtccttctag gtcaactcca gccgcggaaa catctttcgg cgcccaggat 6540 
gatggagcca aacaaggaga agcattcgaa ggaacgctta tcagaaagca cacttacgag 6600 
agtctcgacc ggaaagcggc taaccgctca tgggagaaac tttacgcagt gttgcggcag 6660 
aatgaactgt cattctacaa agatccgaag catagagatg aatctgtgca tggagagcca 6720 
ccaatggcac tgccaggatg ctctgtcaac gtggcatcag actatcaaaa gaagaagaac 6780 
gtactgtcgc tgaggttacc catcggcgcc gagtacctat tccaatgcgg tagcgaagaa 6840 
gacatacagc gatggctcac cgagttgcaa gtcgccacgg ggcaagccca actggaagag 6900 
gctagtagat cacagacgct accagcagaa ggcagtgcca caaagaagaa gggaggattt 6960 
ttctctcgag gcaagaagta a 

<210> 12 
<211> 2326 
<212> PRT 

<213> Caenorhabditis elegans 

<400> 12 . 
Met Ser Ser Val Asp Asp Glu His His Pro Ser lie Glu Val Asp lie 
15 10 15 

Pro Asp Val Tyr Glu Ser Ala Ser Asp Cys Glu Lys Thr Glu Asp Cys 
20 25 30 

Thr Tyr Leu Gly Ser He Leu Lys Ala Lys Lys Ser Leu Arg Lys Thr 
35 40 45 
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Lys Lys Val Ala Ser Pro Gly He Tyr Pro Giy Ser Ala Gly Ala Leu 
50 55 60 

Ser Ser Leu Ser Asn Gly Leu Ala Asp Glu Asn Arg Ser Pro Thr Ser 
65 70 75 80 

Ala Glu Val Ser Asp Ser Val Cys Gly Gin Lvs Ser He Asn Ser Val 

85 SO 95 

Asp Leu Arg Phe Arg Gly Leu Arg Asp Glu A.rg Glu Leu Val Gin Lys 
100 105 110 

Lys Thr Phe Thr Lys Trp Val Asn Ser His Leu Val Arg Val Ser Cys 
115 120 125 

Lys Val Gin Asp Leu Tyr Met Asp Met Arg A.sp Gly Lys Met Leu Leu 
130 ^ 135 140 

Arg Leu Leu Ala Val Leu Ser Gly Glu Arg Leu Pro Lys Pro Thr Pro 
145 150 155 160 

Gly Lys Met Arg lie His Cys Leu Glu Asn Val Glu Lys Gly Leu Gin 
165 170 175 

Phe Leu Arg Asn Gin His Val His Leu Glu Asn Leu Gly Ser His Asp 
180 185 190 

He Val Asp Gly Asn Ser Arg Leu Thr Leu Gly Leu lie Trp Thr He 
195 200 205 

He Leu Arg Phe Gin He Gin Asp He Thr Phe Glu Asp Ala Asp Asn 
210 215 220 

His Glu Thr Arg Ser Ala Lys Glu Ala Leu Leu Leu Trp Cys Gin Met 
225 230 235 240 

Lys Thr Ala Gly Tyr Pro Asn Val Asn Val Lys Asn Phe Ser Thr Ser 
245 250 255 

Trp Arg Asp Gly Leu Ala Phe Asn Ala Leu He His Lys His Arg Pro 
260 265 270 

Asp Leu Val Asp Tyr Asp Asn Leu Gin Lys Ser Asn Ala Leu Tyr Asn 
275 280 285 

Leu Gin Ser Ala Phe Asp Thr Ala Glu Asn Gin Leu Gly Leu Ala Lys 
290 295 300 

Phe Leu Asp Ala Glu Asp Val Asn Val Asp Gin Pro Asp Glu Lys Ser 
305 310 315 320 

He He Thr Tyr Val Val Thr Tyr Tyr His Tyr Phe Asn Lys Leu Lys 
325 330 335 

Gin Asp Asn He Gin Gly Lys Arg He Gly Lys Val He Asn Glu Leu 
340 345 350 



Met Glu Asn Asp Lys Met He Asn Arg Tyr Glu Thr Leu Ser Ser Asp 
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355 350 365 

Leu L-u Glu Trp He Asn Ala Lys He Gin Leu Leu Asn Glu Arg His 
370 375 380 

Phe Glu Asn Asn Leu Glu Gly Val Gin Arg Gin Leu Thr Glu Phe Asn 
385 390 395 400 

&SD T'-r A^^a -^hr Gin Glu Lys Pro Pro Lys Phe Asp Glu Lys Gly Glu 
■* ■ " " 405 410 415 

Teu G'U Val Leu Leu Phe Thr Leu Gin Ser Ala Met Arg Ala Asn Asn 
420 425 430 

Gin A-g Pro Phe Val Pro Arg Glu Gly Lys Leu He Ala Asp He Asn 
435 440 445 

z^rq Trp Gin Ser Leu Glu Lys Ala Glu His Glu Arg Glu Leu Val 

450 " 455 460 

L-u Lvs Glu Glu Leu He Arg Gin Glu Lys Leu Glu Gin Leu Ala Ala 
465 " 470 475 480 

z^rg Pre Asn Arg Lys Ala Glu Met Arg Glu Thr Trp Leu Thr Glu Asn 

485 490 495 

Gin Ara Leu Val Ser Gin Asp Asn Phe Gly Asn Asp Leu Ser Ser Val 
500 505 510 

Glu Ala Ala Thr Lys Lys His Glu Ala He Glu Thr Asp He Phe Ala 
515 520 525 

Tvr Glu Glu Arg Val Gin Ala Val Val Ala Val Ala Gly Glu Leu Glu 
530 535 540 

Ala Glu Asn Tyr His Asp Gin Ala Lys He Asn Glu Arg Lys Glu Asn 
545 550 555 560 

Val Leu Gin Leu Trp Asn Tyr Leu Phe Gin Leu Leu Leu Ala Arg Arg 

565 570 575 

Val Arg Leu Glu Leu Ser Met Ala He Gin Lys He Phe His Asp Met 
580 585 590 

Leu Leu Thr Leu Asp Leu Met Asp Asp He Lys Ser Arg Leu Leu Ser 
595 600 605 

Glu Asp Leu Gly Ala His Leu Met Asp Val Glu Asp Leu Leu Gin Lys 
610 615 620 

His Ala Leu Leu Glu Ser Asp He Asn He He Gly Glu Arg Val Asn 
625 630 635 640 

Asn Ser He Ala Gin Ala Gin Arg Phe Arg Asn Pro Asp Gly Pro Asp 

645 650 655 

Gly Ser Gly Tyr Lys Pro Val Glu Pro Gly Thr He Asp Glu Arg Ser 
660 665 670 
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Asp Val Leu Gin Lys Arg Tyr Lys Glu Leu Leu Asp Leu Ala Ala Glu 
675 680 685 

Arg Lvs Arg Arg Leu Glu Asp Asn Lys Arg Leu Cys Gin Phe Trp Trp 
690 695 700 

Asp Val Ala Glu Leu Glu His Gly lie Lys Glu Gin Glu Gin Val Leu 
705 710 ^715 72 0 

Ser Ser Tnr Asp Thr Gly Arg Asp lie Val Thr Val Ser His Leu Leu 
725 ' 730 735 

Ala Lys His Lvs Asn Ala Glu A.sn Asn Leu Arg Asp Leu Glu Lys Tyr 
740 745 750 

Leu Asp Arg Leu Asd Val Ser Giy Lys Glu Leu Gin Asp Glu Ser lie 
755 ' 760 765 

Pro Gly Ser As'^ Asn lie Pro Pro Arg Leu Ala Glu lie Arg Asp Tyr 
770 ' 775 780 

lie Asn Lys Leu Lvs Glu Leu Ser Ala Ser Arg Lys Glu Arg Leu Ala 
785 ' " 79C 795 800 

Gly Gly Val Glu Tvr Tvr Gin Phe Phe Thr Asp Ala Asp Asp Val Asp 
805 " 810 815 

Arg Tyr Leu Tyr Asp Tnr Leu Arg Val Met Ser Ser Glu Asp Val Giy 
820 825 830 

Lys Asp Glu Giy Thr Val Gin Leu Leu Leu Lys Lys His Asp Asp Val 
835 840 845 

His Asp Glu Leu Gin Asn Phe Asp Gin His lie Lys Val Leu His Ala 
850 855 860 

Lys Ala Glu Ser Leu Pro Gin Glu Ala Arg Glu His Pro Asp lie Arg 
865 870 875 880 

Gin Arg Leu Aso Thr Thr Leu Lys Gin Lys Ala Glu Leu Glu Asn Leu 
885 890 895 

Ser Gin Leu Arq Lvs Gin Arg Leu lie Asp Ala Leu Ser Leu Tyr Lys 
900 ^ 905 910 

Leu Tyr Ser Asp Ala Asp Ser Val Glu Ser Trp lie Asp Glu Lys Gly 
915 920 925 

Lys Leu Leu Ala Thr Leu Val Pro Gly Arg Asp lie Glu Glu Val Glu 
930 935 940 

lie Met Lys His Arc Phe Asp Thr Leu Glu Gin Asp Met Lys Asn Gin 
945 ^ 950 955 ' 960 

Glu Ala Lys Val Thr Asn Val Asn Asp Leu Ala Arg Gin Leu Leu Asn 
965 970 975 
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Val Giu His Pro Asn Ser Asp Asp He Leu His Arg Gin Asn Lys Leu 
980 985 990 

Asn Ala Arg Trp Ala Gin Leu Arg Asp Met Val Asp Gin Lys Arg Asn 
995 1000 1005 

Giu Leu Giu Arg Ala His Arg Leu Giu Thr Phe Arg He Asp Cys Gin 
1010 1015 1020 

Giu Thr Val Thr Trp He Giu Asp Lys Thr Arg Val Leu Giu Asp Ser 
1025 1030 1035 1040 

ASP Ala Leu Thr Asn Asp Leu Ser Gly Val Met Lys Leu Gin Arg Arg 
^ 1045 1050 1055 

Leu ser Met Met Giu Arg Asp Leu Gly Ala He Gin Ala Lys Leu Asp 
1060 1065 1070 

Ser Leu His Lys Giu Ala Asp Asp He Giu Arg Giu Arg Pro Gin Giu 
1075 1080 1085 

Ala Gin Ala He Arg Giu Asp He Lys Arg He His Gin Val Trp Asp 
1090 1095 1100 

He Leu Asn Lys Lys Val Arg Giu His Giu Ala Lys Leu Asp Giu Ala 
1105 1110 1115 1120 

Gly Asp Leu Gin Arg Phe Leu Arg Asp Leu Asp His Phe Gin Ala Trp 
1125 1130 1135 

Leu Thr Ala Thr Gin Arg Gin Val Ala Ser Giu Giu Giu Pro Gin Ser 
1140 1145 1150 

Leu Ala Giu Ala Giu Gin Leu Leu Asn Gin His Ala Ala He Arg Giu 
1155 1160 1165 

Giu He Asp Gly Tyr Ala Giu Asp Tyr Lys Lys Met Arg Ala Met Gly 
1170 1175 1180 

Asp Arg Val Thr Gin Asp Gin Thr Asp Pro Gin Tyr Met Phe Leu Arg 
1185 1190 1195 1200 

Gin Arg Leu Ala Gly Leu Gin Giu Gly Trp Giu Giu Leu Gin Arg Met 
1205 1210 1215 

Trp Asp Asn Arg Gin His Leu Leu Ser Gin Gly Leu Asn Leu Gin Met 
1220 1225 1230 

Phe Leu Arg Asp Ala Lvs Gin Ala Giu Val Met Leu Ser Gin Gin Giu 
1235 1240 1245 

Asn Tyr Leu Ala Lys Asp Asp He Pro Gin Ser Leu Giu Gin Ala Giu 
1250 1255 1260 

Asn Gin Leu Lys Arg His Gin Asp Phe He Thr Thr Met Asp Ala Asn 
1265 1270 1275 1280 

Asp Giu Lys He Arg Ala Val Gly Met Phe Gly Asp Gin Leu Cys Gin 
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1285 1290 1295 

Asp Gly His Tvr Ala Ala Asp Lys lie His Lys Lys Ala Arg Asn lie 
1300 1305 " 1310 

Asp Glu Arg Arg Gly Ala Asn Arg Glu Lys Ala Gin Glu Val Leu Lys 
1315 1320 1325 

Lys Leu Lys Asp Ala Leu 5er Leu Gin Gin Phe Leu Ser Asp Cys Asp 
1330 ' 1335 1340 

Glu Leu Arg Glu Trp lie Glu Glu Lys Met lie Arg Ala Gin Asp Glu 
1345 1350 1355 1360 

Thr Tyr Arg Asp Ala Lys Thr lie Thr Ser Lys Phe Val Arg His Gin 
1365 1370 "^1375 

Ala Phe Gin Ser Glu Leu Ala Ala Asn Lvs Glu Arg Leu Asp Gin Leu 
1380 1385 1390 

Lys His Ala Ala lie Asn Leu Gly Asp Asp Lys Pro Glu Tyr His Gly 
1395 1400 " 1405 

Thr lie Asp Pro Gin lie Giu Glu Leu Ala Thr Gin Trp Asp Glu Leu 
1410 1415 1420 

Giu Lys Thr Thr Glu Giu Lys Gly Gin Lys Leu Phe Asp Ala Asn Arg 
1425 1430 1435 1440 

Gin Gin Leu Tyr Val Gin Ser lie Ala Asp Met Lys Glu Trp Ala Thr 
1445 1450 1455 

Gin Leu Glu Asn Glu Met Thr Arg Giu Asp Gin Pro Gly Asp Leu Thr 
1460 1465 1470 

Thr Val Asn Val Ala Met Gin Lys Gin His Leu lie Glu Thr Giu Met 
1475 1480 1485 

lie Lys Lys Ala Gin His lie Asp Gin Leu Met Giu Met Glu Pro Gin 
1490 1495 1500 

Leu Glu Glu Leu His Pro Asp Glu Leu Giu Asn lie Lys Ala His Arg 
1505 1510 1515 ^ 1520 

Leu Ala Val Gin Giu Gin Leu Gin Arg Leu Gin Ala Pro Leu Asp Asp 
1525 1530 1535 

Arg Arg Lys Ala Leu Giu Arg Lys Lys Ala Ala Phe Gin Phe Gly Arg 
1540 1545 1550 

Asp Val Asp Asp Glu Lys Leu Trp lie Ser Glu Arg Leu Val Leu Ala 
1555 1560 1565 

Lys Ala Gin Asn Leu Gly Glu Ser Leu Pro Asp Cys His Arg Leu Gin 
1570 1575 1580 

Lys Asn Leu Gin Leu Leu Ser Asn Glu lie Asp Asn His Glu Pro Trp 
1585 1590 1595 1600 



wo 00/73328 



PCT/EPOO/0S1O8 



Tie J^s- r,ln Tie Cvs Asn Asn Gly Gin Glu Leu He Asp Glu Gly His 
1605 1610 161^ 

Ala ^sn Gly Pro Ala Phe Glu Lys Lys He Gin Glu Leu Arg Ser Ala 
1620 1625 1630 

T^D G- Glu Leu Lvs Glu Ala Val Lvs Asp Arg Lys Gly Asp Leu Gly 
1535 ' 1640 1645 

Glu Glu Lys Ala His Gin Phe Leu Tyr Asp Cys Gly Glu Ala Glu 

1650 ' 1655 1660 

Ala T-o Me- Se- Glu Gin Glu Leu Tyr Met Met Gin Asp Glu Arg Gly 
1665 " " ' 1670 1675 1680 

Lvs i^s- Gil- ^''e S«- Thr Lvs Asn Gin He Lys Lys His Glu Arg Leu 
" ' 1685 ' 1690 1695 

Gin As- -i- »5p L'.- Phe Ala Asp Thr He Arg Ala Leu Ala Thr 

■ iTOu ' 1705 1710 

Lys A^a Hi-. Lvs Fr- Val Glu Glu Lys Ser Pro Leu Thr Glu Gin He 
1715 ' 1720 1725 

Gin V-1 Arg Gin Ala Gin He Glu Lys Leu Tyr Ala Gly Leu Gin Asp 
1730 1735 1740 

Leu Ser Lys Glu Arg Arg Lys Arg Leu Glu Glu Thr Leu Glu Leu Tyr 
1745 1750 1755 1760 

Ala Leu His Arg Glu He Asp Asp Leu Leu Gin Trp He Ala Asp Lys 
1765 1770 1775 

Glu val Val Ala Giv Ser Gin Glu Asn Gly Gin Asp Tyr Glu His Val 
1780 ' 1785 1790 

Gin Met Leu Gin Glu Arg Phe Gin Gin Phe Ala Arg Asp Thr Glu Asn 
1795 1800 1805 

He Gly Ser Glu Arg Val Ala Asn Ala Asn Asp Gly Cys Asp Thr Leu 
1810 1815 1820 

He GW His Glv His Thr Asp Ala Pro Thr He Ala Leu Trp Lys Asp 
1825 ' ' 183C 1835 1840 

Ser L=u Asn Glu Ala Trp Glu Asn Leu Leu Glu Leu Met Asp Thr Arg 
1845 1850 1855 

Ala Gin He Leu Glu Ala Ser Arg Leu Leu His Lys Phe Tyr His Asp 
I860 1865 1870 

Cys ;^-g ASP Cys Leu Ser Arg He Met Glu Lys Thr His Ala Met Pro 
1875 1880 1885 

Asp Asp Leu Gly Arg Asp Ser Ser Ser Val Gly Ala Leu Ser Arg Lys 
1390 1895 1900 
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His Gin Asn Tyr Leu Lys Asp lie Ala Ala lie Giy Glu Gin Val Ala 
1905 1910 1915 1920 

Gin He Glu Arg Asd Ala Ala Glu Leu Arg Asp Gly Tyr Ala Gly Asp 
1925 1930 1935 

Lvs Ala Leu A.sp He Gly Ser A.rg Glu Ser Glu Val Val Lys Ala Trp 
1940 1945 1950 

Arg His Leu Arg Gly Leu Cys Asp Ala Arg Thr Ser Arg Leu Met Asp 
1955 1960 1965 

Tnr Ser Asp Leu Phe Lys Phe Met Asn Met Val Arg Asp Leu Leu Leu 
197C ^ 1975 1980 

Trp Met Asd Glu Val Lys Arg Glu Met Asn Ser Gin Glu Arg Pro Lys 
1985 ^ 1990 1995 2000 

Aso Val Ser Gly Val Glu Leu Leu Met Asn Asn His Gin Ser Leu Lys 
2005 2010 2015 

Ala Glu He Asd Ala Arg Glu Glu Asn Phe Asn Ala Cys He Ser Leu 
2020 2025 2030 



Gly Arg Asp Leu Leu Asn Arg Lys 

2035 2040 

Lys Lys Leu He Lys Leu Thr Thr 

2050 " 2055 

Trp Glu Asp Arg Trp Glu Tyr Leu 

2065 2070 

Phe Ala Arg Asp Ala Ala Val Ala 
' 2085 



His Tyr Ala Ser Ser Glu He Glu 
2045 

Glu Arg Ala Glu Met Met Arg Arg 
2060 

Gin Leu He Leu Glu Val Tyr Gin 

2075 2080 

Glu Ser Trp Leu Phe Ala Gin Glu 
2090 2095 



Pro Tyr Leu He Ser Lys Glu Tyr Gly Arg Asn Leu Glu Glu Thr He 
2100 2105 2110 

Lys Leu He Lys Lys His Glu Ala Phe Glu Lys Ser Ala Phe Ala Gin 
2115 2120 2125 

Glu Glu Arg Phe Leu Ala Leu Glu Lys Leu Thr Thr Phe Glu Leu Lys 
2130 2135 2140 

Glu Thr Gin His Arg Glu Glu Glu Thr Ala Lys Arg Arg Gly Pro Ala 
2145 2150 2155 2160 

His He Gly Ser Pro Ser Arg Ser Thr Pro Ala Ala Glu Thr Ser Phe 
2165 2170 2175 

Gly Ala Gin Asp Asd Gly Ala Lys Gin Gly Glu Ala Phe Glu Gly Thr 
2180 2185 2190 

Leu He Arg Lys His Thr Tvr Glu Ser Leu Asp Arg Lys Ala Ala Asn 
2195 ^ 2200 2205 



Arg Ser Trp Glu Lys Leu Tyr Ala Val Leu Arg Gin Asn Glu Leu Ser 
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221C 2215 2220 

-^v- Lvs Asp Pro Lys His Arg Asp Glu Ser Val Hxs Gly Glu Pro 
225^"^ 2230 2235 2240 

Ala Leu Pro Glv Cys Ser Val Asn Val Ala Ser Asp Tyr Gin 
2245 ^ 2250 2255 

L^'S Asn Val Leu Ser Leu Arg Leu Pro lie Gly Ala Glu Tyr 
' ' 2260 2265 2270 

: ^n-^ Gin Cys Gly Ser Glu Glu Asp Met Gin Arg Trp Leu Thr Glu 
22'75 2280 2285 

- ' Ala Thr Glv Gin Ala Gin Leu Glu Glu Ala Ser Arg Ser 

^ 2295 2300 

' Pro Ala Glu Glv Ser Ala Thr Lys Lys Lys Gly Gly Phe 

. 2310 ^ 2315 2320 

L^o: Ar:: Gly Lys Lys 
^ 2325 



2192 

Cae no rhabditis eiegans 



<4 00> 13 
naacct-QCLg 
racgacaag 3 
-gggacgagt 
oaacaact t t 
gaga tgacLC 
caacat":: Tiga 
arggacccac 
c tggcagct c 
z tggaacgca 
a tztcrgaga 
oa tcgtcrcc 
atcaaccaga 
gcattcgaga 
aaagacagaa 
ggagaggctc 
aaagacgag t 
gacaaattcg 
aagt caccac 
ggtcfccaag 
gctcttcatc 
ggaagtcaag 
cagttrgctc 
tgtgatactc 
- cgt tgaatg 
gaggcttcta 
atggaaaaga 
ctctccagaa 
caaatcgaaa 



craacaagga 

cagaguacca 

tggaaaagac 

acgtacaaag 

gtgaagacca 

tcgaaacrga 

agctcgagga 

aagaacaact 

agaaggccgc 

gacttgttct 

agaagaacct 

tctgtaataa 

agaagattca 

agggagatct 

aagcctggat 

rctcaaccaa 

CTigatacgat 

tcaccgaaca 

atttgrcgaa 

gtgaaarcga 

agaacggaca 

gtgacaccga 

tcattggtca 

aggcctggga 

gacttcttca 

ctcatgctat 

aacatcagaa 

gagatgccgc 



acgtcttgac 

cggaaccat t 

tacagaggag 

tatrgctgac 

accaggagat 

aatgatcaag 

actt caccca 

tcaaagactc 

cttccaattt 

cgctaaagcr 

acaacrtct t 

tggrcaagag 

agaactgcgc 

tggagaaagc 

gagcgaacaa 

gaaccaaa tc 

ccgrgctctt 

gatccaagtt 

ggagcgcaga 

cgatcttctt 

agactacgaa 

aaacatcgga 

cggacacact 

gaatcttctg 

caagttctac 

gcctgatgat 

ctatctcaag 

agagcttcga 



caacttaaac 

gatccacaga 

aagggacaga 

atgaaggaat 

crtaccactg 

aaggctcaac 

gacgagcttg 

caggctccat 

ggtcgcgarg 

caaaatcttg 

tccaatgaaa 

ttgattgacg 

agtgcttggc 

gagaaggctc 

gaattgtaca 

aagaaacacg 

gccaccaagg 

cgtcaagctc 

aagagactcg 

caatggatcg 

cacgtccaaa 

tccgaacgtg 

gacgctccga 

gagttgatgg 

cacgactgcc 

cttggacgtg 

gatattgctg 

gatggttacg 



atgctgccat 

tcgaggaact 

agttgtttga 

gggcaaccca 

tcaatgtggc 

acattgatca 

aaaacattaa 

tggacgaccg 

ttgargacga 

gagaaagcct 

tcgacaacca 

aaggacatgc 

aagagttgaa 

atcaa ttctt 

tgatgcaaga 

aacgccttca 

ctcacaagtt 

aaatcgagaa 

aagagacctt 

ccgacaagga 

tgcttcaaga 

ttgccaatgc 

ctatagcatt 

acactcgtgc 

gtgattgctt 

actcaagcag 

caatcggaga 

ctggagacaa 



caaccttggt 60 
cgctactcaa 120 
tgctaaccgc 180 
acttgagaac 240 
catgcaaaaa 300 
actcatggag 360 
ggctcatcgt 420 
acgtaaggca 480 
gaaattgtgg 540 
tccagattgt 600 
cgaaccatgg 660 
taacggacca 720 
ggaagccgtc 780 
gtatgattgc 840 
cgaaagagga 900 
atccgatatt 960 
cgtcgaagag 1020 
gctctacgca 1080 
ggaactttac 1140 
agttgttgca 1200 
gagattccaa 1260 
taatgatgga 1320 
gtggaaggat 1380 
ccaaattctc 1440 
gtcaagaatc 1500 
tgttggtgca 1560 
acaagttgct 1620 
ggctcttgat 1680 
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atcggatctc gcgaaagcga ggttgtcaag gcatggaggc atcuccgcgg acrctgtgac 1740 

gcccgtacat ctcgtctcat ggatacttct gatcttttca agttcatgaa tatggtccgt 1800 

gatttgttgt tgtggatgga tgaggtgaag agagaaatga actcacaaga acgtccgaag 1860 

gatgtgagcg gagtcgagtt gctcatgaac aatcatcaat cgctcaaggc tgaaatcgat 1920 

gctcgcgagg agaacttcaa cgcgtgcatc tcattgggac gagatctgct caacagaaaa 1980 

cactacgcat caagtgaaat cgagaagaag ctqatcaagt tgacaaccga aagagccgaa 2040 

atgatgcgcd gargggaaga tcgttgggaa tatcrtcagc t:cattcT:gga agtgtaccaa 2100 

trcgcccgcg acgctgctgt: cgccgaatct tiggctcttcg cccaagaacc aratcttart 2160 

tcgaaggaat acggtagaaa cctcgaagaa ac 2192 

<210> 14 
<211> 731 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 14 

Glu Leu Ala Ala Asn Lys Glu Arg Leu Asp Gin Leu Lys His Ala Ala 
1 5 " 10 15 

lie Asn Leu Giy Asp Asp Lys Pro Glu Tyr His Gly Thr lie Asp Pro 
20 25 ' 30 

Gin lie Glu Glu Leu Ala Thr Gin Trp Asp Glu Leu Glu Lys Thr Thr 
35 40 45 

Glu Glu Lys Gly Gin Lys Leu Phe Asp Ala Asn Arg Gin Gin Leu Tyr 
50 55 60 

Val Gin Ser lie Ala Asp Met Lys Glu Trp Ala Thr Gin Leu Glu Asn 
65 70 75 80 

Glu Met Thr Arg Glu Asp Gin Pro Gly Asp Leu Thr Thr Vai Asn Val 

85 90 95 

Ala Met Gin Lys Gin His Leu lie Glu Thr Glu Met lie Lys Lys Ala 
100 105 110 

Gin His lie Asp Gin Leu Met Glu Met Glu Pro Gin Leu Glu Glu Leu 
115 120 125 

His Pro Asp Glu Leu Glu Asn lie Lys Ala His Arg Leu Ala Val Gin 
130 135 140 

Glu Gin Leu Gin Arg Leu Gin Ala Pro Leu Asp Asp Arg Arg Lys Ala 
145 150 155 160 

Leu Glu Arg Lys Lys Ala Ala Phe Gin Phe Giy Arg Asp Val Asp Asp 
165 170 175 

Glu Lys Leu Trp lie Ser Glu Arg Leu Val Leu Ala Lys Ala Gin Asn 
180 185 190 

Leu Gly Glu Ser Leu Pro Asp Cys His Arg Leu Gin Lys Asn Leu Gin 
195 200 205 

Leu Leu Ser Asn Glu lie Asd Asn His Glu Pro Trp lie Asn Gin lie 
210 215 220 
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Cvs Psr. Asn Gly Gin Glu Leu He Asp Glu Gly His Ala Asn Gly Pro 
225 230 235 240 

Ha Phe Glu Lvs Lys He Gin Glu Leu Arg Ser Ala Trp Gin Glu Leu 
' 245 250 255 

lvs G'-.^ 2ila Val Lys Asp Arg Lvs Gly Asp Leu Gly Glu Ser Glu Lys 
260 265 270 

--.a His Gin Phe Leu Tyr Asp Cys Gly Glu Ala Glu Ala Trp Met Ser 
275 280 285 

Giu Gin Glu Leu Tyr Met Met Gin Asp Glu Arg Gly Lys Asp Glu Phe 
290 295 300 

^-r TH-- Lvs Asn Gin He Lys Lys His Glu Arg Leu Gin Ser Asp He 
305 ' 310 315 320 

-so L^- Ph<=- A^a ASP Thr He Arg Ala Leu Ala Thr Lys Ala His Lys 

325 330 335 

nn- Val Glu Glu Lys Ser Pro Leu Thr Glu Gin He Gin Val Arg Gin 
340 345 350 

^la Gi^ He G^ u Lys Leu Tyr Ala Gly Leu Gin Asp Leu Ser Lys Glu 
355 360 365 

Arg Arg Lys Arg Leu Glu Glu Thr Leu Giu Leu Tyr Ala Leu His Arg 
370 375 380 

Glu He AsD Asp Leu Leu Gin Trp He Ala Asp Lys Glu Val Val Ala 
385 ' 390 395 400 

G'v Ser Gin Glu Asn Gly Gin Asp Tyr Glu His Val Gin Met Leu Gin 

405 410 415 

Glu Arg Phe Gin Gin Phe Ala Arg Asp Thr Glu Asn He Gly Ser Glu 
420 425 430 

Arg Val Ala Asn Ala Asn Asp Gly Cys Asp Thr Leu He Gly His Gly 
435 440 445 

His Th- Asp Ala Pro Thr He Ala Leu Trp Lys Asp Ser Leu Asn Glu 
450 455 460 

Ala Trp Glu Asn Leu Leu Glu Leu Met Asp Thr Arg Ala Gin He Leu 
465 470 475 480 

Glu Ala Ser Arg Leu Leu His Lys Phe Tyr His Asp Cys Arg Asp Cys 

485 490 495 

Leu ser Arg He Met Glu Lys Thr His Ala Met Pro Asp Asp Leu Gly 
500 505 510 

Arg Asp Ser Ser Ser Val Gly Ala Leu Ser Arg Lys His Gin Asn Tyr 
515 520 525 

Leu Lys Asp He Ala Ala He Gly Glu Gin Val Ala Gin He Glu Arg 
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530 535 540 

Asp Aia Ala Glu Leu Arg Asp Gly Tyr Ala Gly Asp Lys Ala Leu Asp 
545 550 555 560 

He Glv Ser Arg Glu Ser Glu Val Val Lys Ala Trp Arg His Leu Arg 
565 570 575 

Gly Leu Cys Asd Ala Arg Thr Ser Arg Leu Met Asp Thr Ser Asp Leu 
580 585 590 

Phe Lvs Phe Met Asn Met Val Arg Asp Leu Leu Leu Trp Met Asp Glu 
595 600 605 

Val Lvs Arg Glu Met Asn Ser Gin Glu Arg Pro Lys Asp Val Ser Gly 
610 615 620 

Val Glu Leu Leu Met Asn Asn His Gin Ser Leu Lys Ala Glu He Asp 
625 630 635 640 

Ala Arg Glu Glu Asn Phe Asn Ala Cys lie Ser Leu Gly Arg Asp Leu 

645 650 655 

Leu Asn Arg Lys His Tyr Ala Ser Ser Glu He Glu Lys Lys Leu He 
660 665 670 

Lys Leu Thr Thr Glu Arg Ala Glu Met Met Arg Arg Trp Glu Asp Arg 
675 680 685 

Trp Glu Tyr Leu Gin Leu He Leu Glu Val Tyr Gin Phe Ala Arg Asp 
690 695 700 

Ala Ala Val Ala Glu Ser Trp Leu Phe Ala Gin Glu Pro Tyr Leu He 
705 710 715 720 

Ser Lys Glu Tyr Gly Arg Asn Leu Glu Glu Thr 
725 730 



<210> 15 
<211> 3561 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 15 

atgagctcta gtagtagtga tgaaaatgaa acaacgatac acagaacagg atcaaatacc 60 

ggtggaagtg gaatctactc acaacctcga gcaggatcat caaaaagaac atcgaatgtg 120 

cgacaugatg ttagtgatgt agatgatgaa gaagagcatt atgcaagatt tcgcgaagat 180 

acggcgatcg aggttgacga tgctataaca gttcttcttt catctctaca tttcgaacac 240 

aaacgtgata ttgttccgac cgatgaagat gataataaac ttcgagaact tcacgaaaag 300 

atttttgcat tgataacgag tgaatctgat gttaacagaa aaagacggct gaaaaaagct 360 

cttccrgcgt caaactgtgt tagagaacaa gtatattatc ttcgaaggaa accarcaaca 420 

ccaccagctt cttattatca tcgactaaat gcagctcttc acacaatcgt taaggaatca 480 

tttggagaag aatatcgaaa agttgctaca gtattaggac ttgttgaagc gcttgcagaa 540 

gttttaattc ttgaagttca cacatttgga atcaatgaga ccaatcctgg agagcatcga 600 

aatatrcgaa aacttattgc caatgctctt acgaatttaa catacggtca aattcactct 660 

aaacgccggc tatgctcata tgatggattc attcgatgcg ttgttcgaat tgtgatcgaa 720 

tctccaaata taactcaagt atacgctgga ctaatcagaa atctctcatg gaacgccgat 780 
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rcaggaatga gtgaagctct tcaaccaaca gttcacgcac tttcaattgc tgcagtacat 840 
qcacatacac atagattcga tgttacagca acattatctg ctttgtggaa tcttgctggt 900 
cactcagtag agaataaacg aactatttgt gatactccga attgtttaaa agtatragcc aou 
agccttctat ctcctgatgc aagattcacg tcacttgttg attcagctac tggaattctg 1020 
aagtatgtat cacaatatct cgcgaatact tccacgcatt tagagctaag aagcttgctc 1080 
atcactcaaa tactcacttt gcncaagagt gcctcgttca cttgtgttac aaatactcta 1140 
ggagctattg caaacctaat cgtcaaagat ccccacatgc agcaaatgat tcggcagyat 1200 
atggctgcag tacagcaact caacgtactc cgaaactcga atcgrgacga tattcggaca 1260 
gcagtgaagt ctgtgctgaa tactcttaat caaccgtgta gtcatcgrta tggtgatatg 1320 
tcacattctg ctggaggagg agccacaggg atgcaaatgc tttcggagcc acaacttcag 1380 
atgcaaacat ctcatcatgc ttatcatgga actgcgtcac caagattact atcactgaga 1440 
gcaactcgag catcacctgg aaaatatata caaccacagg ctcaacaaca gttgatacag 1500 
actccacaag ttgatcaaag atctagtagt ctgccaagac attttgccgt tcaacgaaat 1560 
ggatttgtga tggcacagtc ctacaatcaa caaatggatc aacatcagca acaacaaatg 1520 
atctatcaat tgcagcaaca acaacaaatt atgacggagg atcaagcgca aatggaacat 1680 
catcaacaaa tcatgtacct tcagcagcaa caacaacaat ttcatcaaat tcaacaacag 1740 
caacaaatgc aaaaagctca ggaagctgat ccagttcctc ctacggatga cgatctagat 1800 
attcccacat ctaccgtaat gggaacacga agtaacagtg aaagaagtct cggctcgatg 18 60 
aatcctggaa gtgttatgac aaactggaat tcctctctgg atactgcagc caacagttct 1920 
cgtgctcttt ctcctgtatc atacaatgat attccagctt ctccaacaat gtgtgctcag 1980 
atgttcaatt tacctaaatc cactgaatcg gaacatcatc aattgacgtc tcaacagcag 2040 
aacactacac attatagcag tggatcagct aacactatga ctcggagtga tggtgcaact 2100 
actgtgccaa tggataacat tattacacca acttatgcta ttttgaatcc aatattggta 2160 
cacgaacaga caccgaacgg gacagttcca cgaaagactt cagaagagtt ggatagtcct 2220 
gatgatgttc tcccaggtcc atctcttgag gaggaagaag gtgattatgc tataatagga 2280 
ggtgccgcac aaaagacaga tgacgagctt ttgactcgtt ctattcaatc tgaaatgcct 2340 
acttcgtcgt ctactccgaa gatgaaagtc tcgcctcgcc ttaatggatt cttctctcca 2400 
actcagaaaa caacatcgtc tcctgcatgg tctcatccag atacctcacc gattccaaag 2460 
tcatcttctc atcgaactca acctaatcgt cgtcaagatg cttctgatgc agatcgtctt 2520 
ttaatggaat caataatgtc agagatgcca aaaagtcgaa ttatatctcc acgacttgct 2580 
ggtacacaac aatatcttga acccgagcca gagagaagaa gtcattcaaa aaacgaggaa 2640 
gccgatcgcc gcgacgcatt caccgcatcg catgagccca gcgatcataa cggcattgat 2700 
gtggcaagag gctcagactg gtctccacaa caacaattac atcgaatgga gtcattggaa 2760 
agccaagcta gtagtgaaga tagttttgga ttaactgctg aagaaccaaa ttcatcaact 2820 
rctggtgctg ctgcaaacac targagattt gacgatgaga tcgatgcatc tcttccaatg 2880 
gattgtgtcg atgatgatga ttacgactat acttatgatc atttcgaaga ttatgaagat 2940 
gaggaggatc ctgatgcgac tcagtttgat gatggagttg atgcacaact aactatcgat 3000 
tgttcaatga tttctagtgg aagcggttca tcgcaaagaa atgaaactac gacaacgtca 3060 
cgagattcaa aagctttagc tacttccaca ccaaaaggat ctgcatcttc actcccagga 3120 
gttcgacaag ccacccgagt atcaacgaat ggaaagtcaa gactgcctgt tccaaaaacg 3180 
aatggatcac ttgttgataa gaatcccaag ccaatcatcg cttcccggcg gccacgattg 3240 
ccaccgaagc cgacactttt aaaagataag cactatccag aagaggattc gattgagaat 3300 
cagacgagag atgatacgat atatgtgaat gctccagttg ttgaggcaga acaagaacga 3360 
atttatatga atgcattgaa gcaacagaag aacattgagc aatcaccgtc aattggaaac 3420 
ggatcaccta tagctaaatc tgcaattgta acgccctaca attatcagaa accgcctttc 3480 
actggtcgaa ataatggtga aatgagcaac gagaagagtg tgacaccgaa tcctaaacaa 3540 
atgcttgtaa caatagtcta a 

<210> 16 
<211> 1186 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 16 

Met Ser Ser Ser Ser Ser Asp Glu Asn Glu Thr Thr He Hxs Arg Thr 
15 10 15 

Gly Ser Asn Thr Gly Gly Ser Gly He Tyr Ser Gin Pro Arg Ala Gly 
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20 25 30 

Ser Ser Lys Arg Thr Ser Asn Val Arg His Asp Val Ser Asp Val Asp 
35 40 45 

Asp Glu Giu Glu His Tyr Ala Arg Phe Arg Glu Asp Thr Ala lie Glu 
50 55 60 

Val Asp Asp Ala lie Thr Val Leu Leu Ser Ser Leu His Phe Glu His 
65 70 75 80 

Lys Arg Asp lie Val Pro Thr Asp Glu Asp Asp Asn Lys Leu Arg Glu 

85 90 95 

Leu His Glu Lys lie Phe Ala Leu lie Thr Ser Glu Ser Asp Val Asn 
100 105 110 

Arg Lys Arg Arg Leu Lys Lys Ala Leu Pro Ala Ser Asn Cys Val Arg 
115 " ^ 120 125 

Glu Gin Val Tvr Tyr Leu Arg Arg Lys Pre Ser Thr Pro Pro Ala Ser 
130 ^ 135 140 

Tyr Tyr His Arg Leu Asn Ala Ala Leu His Thr lie Val Lys Glu Ser 
145 150 155 160 

Phe Gly Glu Glu Tyr Arg Lys Val Ala Thr Val Leu Gly Leu Val Glu 
165 170 175 

Ala Leu Ala Glu Val Leu He Leu Glu Val His Thr Phe Gly He Asn 
180 185 190 

Glu Thr Asn Pro Gly Glu His Arg Asn He Arg Lys Leu He Ala Asn 
195 200 205 

Ala Leu Thr Asn Leu Thr Tyr Gly Gin He His Ser Lys Arg Arg Leu 
210 215 220 

Cys Ser Tyr Asp Glv Phe He Arg Cys Val Val Arg He Val He Glu 
225 * 230 235 240 

Ser Pro Asn He Thr Gin Val Tyr Ala Gly Leu He Arg Asn Leu Ser 
245 250 255 

Trp Asn Ala Asp Ser Gly Met Ser Glu Ala Leu Gin Pro Thr Val His 
260 265 270 

Ala Leu Ser He Ala Ala Val His Ala His Thr His Arg Phe Asp Val 
275 280 285 

Thr Ala Thr Leu Ser Ala Leu Trp Asn Leu Ala Gly His Ser Val Glu 
290 295 300 

Asn Lys Arg Thr He Cys Asp Thr Pro Asn Cvs Leu Lys Val Leu Ala 
305 310 315 320 

Ser Leu Leu Ser Pro Asp Ala Arg Phe Thr Ser Leu Val Asp Ser Ala 
325 330 335 
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Thr Gl-.' Hp T.ftu Lvs Tvr Val Ser Gin Tyr Leu Ala Asn Thr Ser Thr 
340 ' ' 345 350 

His Leu Glu Leu Arg Ser Leu Leu He Thr Arg Met Leu Thr Leu Leu 
355 360 365 

Lvs Ser Ala Ser Phe Thr Cys Val Thr Asn Thr Leu Gly Ala He Ala 
' 37C 375 380 

Ksn Leu He Val Lys Asp Pro His Met Gin Gin Met He Arg Gin Asp 
385 ' 390 395 400 

Met Al' Ala Val Gin Gin Leu Asn Val Leu Arg Asn Ser Asn Arg Asp 

405 410 415 

z^SD II- Arq Tbr Ala Val Lys Ser Val Leu Asn Thr Leu Asn Gin Pro 
420 425 430 

Cys S-r His Arg Tyr Gly Asp Met Ser His Ser Val Gly Gly Gly Ala 
435 440 445 

Thr Glv Met Gin Met Leu Ser Glu Pro Gin Leu Gin Met Gin Thr Ser 
450 455 460 

His His Ala Tyr His Gly Thr Ala Ser Pro Arg Leu Leu Ser Leu Arg 
465 470 475 480 

Ala Thr Arg Ala Ser Pro Gly Lys Tyr He Gin Pro Gin Ala Gin Gin 

485 490 495 

Gin Leu He Gin Thr Pro Gin Val Asp Gin Arg Ser Ser Ser Leu Pro 
500 505 510 

Arg His Phe Ala Val Gin Arg Asn Gly Phe Val Met Ala Gin Ser Tyr 
515 520 525 

Asn Gin Gin Met Asp Gin His Gin Gin Gin Gin Met He Tyr Gin Leu 
530 535 540 

Gin Gin Gin Gin Gin He Met Thr Glu Asp Gin Ala Gin Met Glu His 
545 550 555 560 

His Gin Gin He Met Tyr Leu Gin Gin Gin Gin Gin Gin Phe His Gin 

565 570 575 

He Gin Gin Gin Gin Gin Met Gin Lys Ala Gin Glu Ala Asp Pro Val 
580 585 590 

Pro Pro Thr Asp Asp Asp Leu Asp He Pro Thr Ser Thr Val Met Gly 
595 600 605 

Thr Arg Ser Asn Ser Glu Arg Ser Leu Gly Ser Met Asn Pro Gly Ser 
610 615 620 

Val Met Thr Asn Trp Asn Ser Ser Leu Asp Thr Ala Ala Asn Ser Ser 
625 630 635 640 
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Arg Ala Leu Ser Pro Vai Ser Tyr Asn Asp He Pro Ala Ser Pre Thr 

645 650 655 

Met Cys Ala Gin Val Phe Asn Leu Pro Lys Ser Thr Glu Ser Glu His 
660 665 670 

His Gin Leu Tiir Ser Gin Gin Gin Asn Thr Thr His Tyr Ser Ser Gly 
675 680 685 

Ser Ala Asn Thr MeL Thr Arg Ser Asp Gly Ala Thr Thr Val Pre Met 
690 695 700 

^sp Asn He He Thr Pro Thr Tyr Ala He Leu Asn Pro He Leu Val 
705 710 715 720 

His Glu Gin Thr Pro Asn Gly Thr Val Pro Arg Lys Thr Ser Glu Glu 

^^25 730 735 

Leu ASD Ser Pre- Asc Asp Val Leu Pro Gly Pro Ser Leu Glu Glu Glu 
74-: 745 750 

Glu Gly AsD Tvr AH i:.- He Gly Gly Ala Ala Gin Lys Thr Asp Asp 
755 ' 760 765 

Glu Leu Leu Thr Arg Ser He Gin Ser Glu Met Pro Thr Ser Ser Ser 
770 775 780 

Thr Pro Lys Met Lvs Val Ser Pro Arg Leu Asn Gly Phe Phe Ser Pro 
785 ^ 790 795 800 

Thr Gin Lys Thr Thr Ser Ser Pro Ala Trp Ser His Pro Asp Thr Ser 

805 810 815 

Pro He Pro Lvs Ser Ser Ser His Arg Thr Gin Pro Asn Arg Arg Gin 
820 825 830 

Asp Ala Ser Asp Ala Asd Arg Leu Leu Met Glu Ser He Met Ser Glu 
835 ' 840 845 

Met Pro Lys Ser Arg He He Ser Pro Arg Leu Ala Gly Thr Gin Gin 
850 855 860 

"yr Leu Glu Pro Glu Pro Glu Arg Arg Ser His Ser Lys Asn Glu Glu 
865 870 875 880 

Ala Asp Arg Arg Asp Ala Phe Thr Ala Ser His Glu Pro Ser Asp His 

885 890 895 

Asn Gly He Asp Val Ala Arg Gly Ser Asp Trp Ser Pro Gin Gin Gin 
900 905 910 

Leu His Arg Met Glu Ser Leu Glu Ser Gin Ala Ser Ser Glu Asp Ser 
915 920 925 

Phe Gly Leu Thr Ala Glu Glu Pro Asn Ser Ser Thr Ser Gly Ala Ala 
930 935 940 

Ala Asn Thr Met Arg Phe Asp Asp Glu He Asp Ala Ser Leu Pro Met 
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945 950 955 960 

Asp CVS Val Asp Asp Asp Asp Tyr Asp Tyr Thr Tyr Asp His Phe Glu 

965 970 975 

Asp TV- Glu Asp Glu Glu Asp Pro Asp Ala Thr Gin Phe Asp Asp Gly 
980 985 990 

Val ASD Ala Gin Leu Thr He Asp Cys Ser Met He Ser Ser Gly Ser 
995 1000 1005 

Gly Ser Ser Gin Arg Asn Glu Thr Thr Thr Thr Ser Arg Asp Ser Lys 
1010 1015 1020 

Ala Leu Ala Thr Ser Thr Pro Lys Gly Ser Ala Ser Ser Leu Pro Gly 
1025 1030 1035 1040 

Val A-q Gin Ala Thr Arg Val Ser Thr Asn Gly Lys Ser Arg Leu Pro 
1045 1050 1055 

Val pr-o Lys Thr Asn Gly Ser Leu Val Asp Lys Asn Pro Lys Pro He 
1060 1065 1070 

He Ala Ser Arg Arg Pro Arg Leu Pro Pro Lys Pro Thr Leu Leu Lys 
1075 1080 1085 

Asp Lys His Tyr Pro Glu Glu Asp Ser He Glu Asn Gin Thr Arg Asp 
1090 1095 1100 

Asp Thr He Tyr Val Asn Ala Pro Val Val Glu Ala Glu Gin Glu Arg 
1105 1110 1115 1120 

He Tyr Met Asn Ala Leu Lys Gin Gin Lys Asn He Glu Gin Ser Pro 
1125 1130 1135 

Ser He Gly Asn Gly Ser Pro lie Ala Lys Ser Ala He Val Thr Pro 
1140 1145 1150 

Tyr Asn Tyr Gin Lvs Pro Pro Phe Thr Gly Arg Asn Asn Gly Glu Met 
1155 ' 1160 1165 

Ser Asn Glu Lys Ser Val Thr Pro Asn Pro Lys Gin Met Leu Val Thr 
1170 1175 1180 



He Val 
1185 



<210> 17 
<211> 1742 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 17 

atgagctcta gtagtagtga tgaaaatgaa 
ggtggaagtg gaatctactc acaacctcga 
cgacatgatg ttagtgatgt agatgatgaa 
acggcgatcg aggttgacga tgctataaca 



acaacgatac acagaacagg atcaaatacc 60 
gcaggatcat caaaaagaac atcgaatgtg 120 
gaagagcatt atgcaagatt tcgcgaagat 180 
gttcttcttt catctctaca tttcgaacac 240 
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aaacgtgata ttgttccgac cgargaagar gataataaac rtcgagaacc rcacgaaaag 300 

attttrgcat tgataacgag tgaatctgat gttaacagaa aaagacggcc gaaaaaagct 360 

cttcctgcgt caaactgtgt tagagaacaa gtatatcanc ttcgaaggaa accatcaaca 420 

ccaccagctt cttattatca tcgacraaat gcagctcttc acacaatcgt taaggaatca 480 

tttggagaag aatarcgaaa agttgctaca gtattaggac ttgttgaagc gcttgcagaa 540 

gttttaattc ttgaagttca cacatttgga atcaatgaga ccaatcctgg agagcatcga 600 

aauatrcgaa aacccartgc caatgctctt acgaatttaa cauacgguca aattcactct 660 

aaacgccggc tatgcLcata tgarggattc atrcgacgcg ttgttcgaat tgtgatcgaa 720 

tctccaaata raacucaagt acacgctgga ctaatcagaa arctctcatg gaacgccgar 780 

tcaggaatga gtgaagctct tcaaccaaca gttcatgcac tttcaattgc tgcagtacat 840 

gcacatacac atagattcga tgttacagca acattatctg ctttgT:ggaa tcrtgctggt 900 

cactcagtag agaataaacg aactatttgc gatactccga attgtttaaa agtattagcc 960 

agccttctat ctcctgatgc aagattcacg tcacttgttg attcagctac tggaattctg 1020 

aagtatgtat cacaatatct cgcgaatact tccacgcatt tagagctaag aagcttgctc 1080 

atcactcgaa tgctcacttt gctcaagagt gcctcgttca cttgtgttac aaatactcta 1140 

ggagctattg caaacctaat tgtcaaagat ccccacangc agcaaatgat tcggcaggat 1200 

atggctgcag tacagcaact caacgtactc cgaaactcga atcgtgatga tattcggaca 1260 

gcagtgaagt ctgtgctgaa tactcttaat caaccgtgta gtcatcgtta tggtgatatg 1320 

tcacattctg tiiggaggagg agccacaggg atgcaaatgc tttcggagcc acaacttcag 1380 

atgcaaacat ctcarcatgc tratcatgga actgcgtcac caagattact arcactgaga 1440 

gcaactcgag catcacctgg aaaatatata caaccacagg ctcaacaaca gttgatacag 1500 

actccacaag ttgaccaaag atctagtagt ctgccaagac attttgccgt tcaacgaaat 1560 

ggatttgtga tggcacagtc ctacaarcaa caaatggatc aacatcagca acaacaaatg 1620 

atctatcaat tgcagcaaca acaacaaatt atgacggagg atcaagcgca aatggaacat 1680 

catcaacaaa tcatgtaccr ncagcagcaa caacaacaat ttcatcaaat tcaacaacag 1740 
ca 1^42 

<210> 18 
<211> 509 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 18 

Met Ser Ser Ser Ser Ser Asp Glu Asn Glu Thr Thr He His Arg Thr 
15 10 15 

Gly Ser Asn Thr Gly Gly Ser Gly He Tyr Ser Gin Pro Arg Ala Giy 
20 25 30 

Ser Ser Lys Arg Thr Ser Asn Val Arg His Asp Val Ser Asp Val Asp 
35 40 45 

Asp Glu Glu Glu His Tvr Ala Arg Phe Arg Glu Asp Thr Ala He Glu 
50 ^ 55 60 

Val Asp Asp Ala He Thr Val Leu Leu Ser Ser Leu His Phe Glu His 
65 70 75 80 

Lys Arg Asp He Val Pro Thr Asp Glu Asp Asp Asn Lys Leu Arg Glu 

85 90 95 

Leu His Glu Lys He Phe Ala Leu He Thr Ser Glu Ser Asp Val Asn 
100 105 110 

Arg Lys Arg Arg Leu Lys Lys Ala Leu Pro Ala Ser Asn Cys Val Arg 
115 120 125 



Glu Gin Val Tyr Tyr Leu Arg Arg Lys Pro Ser Thr Pro Pro Ala Ser 
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13: 135 140 

Tvr TV H^s Arq Leu Asn Ala Ala Leu His Thr lie Val Lys Glu Ser 
145 " 150 155 160 

Dhe G'-- Glu Glu Tyr Arg Lys Val Ala Thr Val Leu Gly Leu Val Glu 

165 1^0 175 

%ia L- -la Giu Val Leu He Leu Glu Val His Thr Phe Gly He Asn 
180 185 190 

Glu Th' Asn Pro Gly Glu His Arg Asn He Arg Lys Leu He Ala Asn 
195 200 205 

Ala Leu Thr Asn Leu Thr Tyr Gly Gin He His Ser Lys Arg Arg Leu 
210 215 220 

Cvs Tv- -SD Glv Phe He Arg Cvs Val Val Arg He Val He Glu 

225 ----- - 23C " 235 240 

Ser P-r, Th- G.r. Val Tvr Ala Gly Leu He Arg Asn Leu Ser 

245 250 255 

Trp A-i Ala Asd Ser Giv Met Ser Glu Ala Leu Gin Pro Thr Val His 
260 ' 265 270 

Ala Leu Ser He Ala Ala Val His Ala His Thr His Arg Phe Asp Val 
275 280 285 

Thr Ala Thr Leu Ser Ala Leu Trp Asn Leu Ala Gly His Ser Val Glu 
290 295 300 

Asn Lvs Arg Thr He Cys Asp Thr Pro Asn Cys Leu Lys Val Leu Ala 
305 ' 310 315 320 

ser Leu Leu Ser Pro Asp Ala Arg Phe Thr Ser Leu Val Asp Ser Ala 

325 330 335 

Thr G- v He Leu Lvs Tvr Val Ser Gin Tyr Leu Ala Asn Thr Ser Thr 
340 ^ 345 350 

His Leu Glu Leu Arg Ser Leu Leu He Thr Arg Met Leu Thr Leu Leu 
355 360 36:3 

Lys Se- Ala Ser Phe Thr Cys Val Thr Asn Thr Leu Gly Ala He Ala 
370 375 380 

Asn Leu He Val Lys Asp Pro His Met Gin Gin Met He Arg Gin Asp 
385 390 395 400 

Met A: a Ala Val Gin Gin Leu Asn Val Leu Arg Asn Ser Asn Arg Asp 

405 410 415 

Asp T^e Arg Thr Ala Val Lys Ser Val Leu Asn Thr Leu Asn Gin Pro 
420 425 430 

Cys His Arg Tyr Gly Asp Met Ser His Ser Val Gly Gly Gly Ala 

435 440 445 
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Thr Glv Met Gin Met Leu Ser Glu Pro Gin Leu Gin Met Gin Thr Ser 

450 455 460 

His His Ala Tyr His Gly Thr Ala Ser Pro Arg Leu Leu Ser Leu Arg 

4£5 470 475 4S0 

Arg Ala Ser Pro Gly Lys Tyr lie Gin Pro Gin Ala Gin Gin 

485 " ^ " 490 495 

:^ . : l -e.; Tie Gin Thr Pro Gin Val Asp Gin Arg Ser Ser 

500 505 



r." rhabditis elegans 



tgtgcgaact gcacatgatg cctgaagagg atcaatggcc cgatgacatt 60 
tcgctgctct caacttttgt ggagaagatt cgatgctctt tgaarcagtt 120 
-itrcitric tatcaactga ttctttagac tctcaacagt ttcgagagag atgtcaaatg 180 
d£.c.c:au-aa- artttcaatt agcatrcgca gactctggac attggcaatc aggtataaat 240 
z-x.aa'izz-a caacttgggg tcggattcgg acatctgaac cattggatga acgtactgcc 300 
a-zzizzczaq atgtttggaa tgraaaaaga agtgatagtg ctcgatcacc aaatcgtcca 360 
aatzcazzga ttgccaactt tgttcccgga gatgctactc gattcgtcga tgtaaatgat 420 
aatqaaantc gcgaggcaaa tgaagaaata attcgtaaag atcgttggag aagagattca 480 
cicgagacaat gctcatctgg aggacaaaat cagaaacgaa catttgcgga tattcrggaa 540 
aaaaatqraa cagcaccaac ttcaatggca atcacatcgt ctgataatga aaagcctccg 600 
aaatraqaci tcctggcaat gcatcatgag atgccatctc tctgcgagtc attcaccgct 660 
ccattccgta atgctattat raaaatgcag aaatgtgagc cactaccgtc gattacttca 720 
accaatqatt ttccgctctt ctttcaagaa gacrctcccg actcaggact cggtrgtagt 780 
qgtccgagcc acattgaaga ttggcaatct ctatctgttc ttctgcccaa acacctcgca 840 
gaagcatgct cattcttcaa gagcaacacc caattattaa catcaagtac atcaaaaaca 900 
czacczcaaa catcaacaaa catcgratca aattgcattg atcgccgtat ctctggtatt 960 
tcagcaccaq caaatgaagc atgtcgaacg tgttatcgag tacgaaggag gatacaccca 1020 
ccagttrggc ctcaaacrgc ccaatcaaag acggttcttt gtgactgcgc atcaacaccc 1080 
acggai-acga atttctcatt tgcaccaaca acttcaacga ctcgccatca actgcgcgca 1140 
aaagagctct caatrgtcgg attgccgatt tatgcagcaa agcggacact cgttgagaat 1200 
gcrgtcgaag gcgttgctgc aatttcaaga ggggatggaa gtgatttgct tgtcatcgcc 1260 
a-gagatgct naatngaaga tggtcttcaa gaaaatgrat cggcatggac aatgattcag 1320 
acggtuacct caaaaggccc agctacaaaa gatgttcatt caattgtaaa acaartggaa 1380 
gaatgrtcaa agactgacaa cgtcaaagtt gaaatcttct ttgaggagct tatcagagaa 1440 
aactcarrgg acrgcrggct gtgctatatt gtgttgaaag agaaggtgct gaaaacattg 1500 
-attccgaaa atgcattcct tctttctgcg agttctgaat acagaactct tctttggaga 1560 
atggiicgaca gtttgtctct tttgccagtg atcgaagctc gcagtgattc agttcaccaa 1620 
cagttcaaat caatgcaaca atggggcgga gcatccagaa tagcttctga ctcaagagtt 1680 
ocaaagagtt catcgttccc agcaagatta tcaactgctc catcaagaag atcaaggata 1740 
ccattgtcaa caagtcgcat cagtatcagc tcaacaactt caacacctcg atcagctcga 1800 
tcaccttcaa cgacatctcg tattcgagtt gcatctataa tgggtgactt cacacttgcc 1860 
aattrctccc tctcagacgg agaaaaagtt agtgttctat caactcgtgg aggacttgct 1920 
cgatgtgtcc gtttaacgac trctcattca aaaatcaaca atggagttat tccga-ctgaa 1980 
catcttcttt tccagtga 1998 

<210> 20 
<211> 665 
<212> PRT 
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<213> Caenorhabditis elegans 
<400> 20 

Met Val Glu Leu Cys Glu Leu His Met Met Pro Glu Giu Asp Gin Trp 
I 5 10 15 

Pro Asp Asp Tie Asp Ala Arg He Ala Ala Leu Asn Phe Cys Gly Glu 
20 25 30 

Asp Ser Met Leu Phe Glu Ser Val Asp Pro Ser Val Ser Thr Asp Ser 
35 40 45 

Leu Asp Ser Gin Gin Phe Arg Glu Arg Cys Gin Met Lys Lys Giu Asp 
50 55 60 

Phe Gin Leu Ala Phe Ala Asd Ser Gly His Trp Gin Ser Gly He Asn 
65 10 ' 75 80 

Aso Asn Leu '^hr Thr Tro Gly Arg He Arg Thr Ser Glu Pro Leu Asp 

85 " 90 95 

Giu Arg Thr Ala Ser Ala Pro Asp Val Trp Asn Val Lys Arg Ser Asp 

100 105 110 

Ser Ala Arg Ser Pro Asn Arg Pro Asn Ser Leu He Ala Asn Phe Val 
115 120 125 

Ser Gly Asp Ala Thr Arg Phe Val Asp Vai Asn Asp Asn Glu He Arg 
130 135 140 

Glu Ala Asn Glu Giu He He Arg Lys Asp Arg Trp Arg Arg Asp Ser 
145 150 155 160 

Ala Arg Arc Cys Ser Ser Gly Gly Gin Asn Gin Lys Arg Thr Phe Ala 

165 170 175 

Asp He Leu Giu Lys Asn Vai Thr Ala Pro Thr Ser Met Ala He Thr 
180 185 190 

Ser Ser Asp Asn Glu Lys Pro Pro Lys Leu Asp Phe Leu Ala Met His 
195 200 205 

His Giu Met Pro Ser Leu Cys Giu Ser Phe Thr Ala Ser Phe Arg Asp 
210 215 220 

Ala He He Lys Met Gin Lys Cys Giu Pro Leu Pro Ser He Thr Ser 
225 230 235 240 

Thr Asn Asp Phe Pro Leu Phe Phe Gin Glu Asp Ser Pro Asp Ser Gly 

245 250 255 

Leu Gly Cys Ser Glv Pro Ser His He Giu Asp Trp Gin Ser Leu Ser 
260 ' 265 270 

Vai Leu Leu Pro Lys His Vai Ala Glu Ala Cys Ser Phe Phe Lys Ser 
275 280 285 

Asn Thr Gin Leu Leu Thr Ser Ser Thr Ser Lys Thr Ala Pro Gin Thr 
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290 295 300 

Ser Thr Asn lie Val Ser Asn Cys He Asp Arg Arg He Ser Gly He 
305 310 315 320 

5^- Ala Se- Ala Asn Glu Ala Cys Arg Thr Cys Tyr Arg Val Arg Arg 
325 330 335 

^rg He His Pro Pro Val Trp Ala Gin Thr Ala Gin Ser Lys Thr Val 
340 345 350 

Leu Cvs Asp Cys Ala Ser Thr Pro Thr Asp Thr Asn Phe Ser Phe Ala 
355 360 365 

-r^- T^r ^ Ser Thr Thr Arg His Gin Leu Arg Ala Lys Glu Leu Ser 
3-0 375 380 

He Val Glv Leu Pro He Tyr Ala Ala Lys Arg Thr Leu Val Glu Asn 
385 ^ 390 395 400 

Val Val Giu Gly Val Ala Ala He Ser Arg Gly Asp Gly Ser Asp Leu 
405 410 415 

Leu Val He Ala Met Arg Cys Leu He Glu Asp Gly Leu Gin Glu Asn 
420 425 430 

Val Ser Ala Trp Thr Met He Gin Thr Val Thr Ser Lys Gly Pro Ala 
435 440 445 

Thr Lvs ASD Val His Ser He Val Lys Gin Leu Glu Glu Cys Ser Lys 
450 ' 455 460 

Thr Aso Asn Val Lys Val Glu He Phe Phe Glu Glu Leu He Arg Glu 
465 " 470 475 480 

Asn Ser Leu Asp Cvs Trp Leu Cys Tyr He Val Leu Lys Glu Lys Val 
485 490 495 

Leu Lys Thr Leu Tyr Ser Glu Asn Ala Phe Leu Leu Ser Ala Ser Ser 
500 505 510 

Glu Tyr Arg Thr Leu Leu Trp Arg Met Val Asp Ser Leu Ser Leu Leu 
515 520 525 

Pro Val He Glu Ala Arg Ser Asp Ser Val His Gin Gin Phe Lys Ser 
530 535 540 

Met Gin Gin Trp Gly Gly Ala Ser Arg He Ala Ser Asp Ser Arg Val 
545 550 555 560 

Pro Lys Ser Ser Ser Phe Pro Ala Arg Leu Ser Thr Ala Pro Ser Arg 
565 570 575 

Arg Ser Arg He Pro Leu Ser Thr Ser Arg He Ser He Ser Ser Thr 
580 585 590 

Thr Ser Thr Pro Arg Ser Ala Arg Ser Pro Ser Thr Thr Ser Arg He 
595 600 605 
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.rg Val Ala Ser lie Met Gly Asp Phe Thr Leu Ala Asn Phe Ser Leu 
61G 615 ^ 620 

.^r i.^- GW Glu Lvs Val Ser Val Leu Ser Thr Arg Gly Gly Leu Ala 
25 ' ' " 630 635 640 

• — Val Arg Leu Thr Thr Ser His Ser Lys He Asn Asn Gly Val 

645 650 655 

; r : He Glu His Leu Leu Phe Gin 
660 665 



-er '_:rnabditis elegans 



- ' gangtcaaat gaaaaaagaa 

*_^rraii'_ caggracaaa tgataatcng 
r j^r v;3ac:T aacgtactgc cagtgctcca 
z z*^c::;a'^c^^r caaatcgtcc aaattcactg 
::c:^zzzz'^2z argtaaatga taatgaaatt 
cc:*^ cgi^gcja gaagagattc agcgagacga 
acamgcgg; atattctgga aaagaatgta 
tctgaraarg aaaagcctcc gaaattagac 
ccctgcgac^ cattcaccgc ttcgttccgt 
czacraccgt: cgattacttc aaccaatgat 
aacccaggac rcggttgrag tggtccgagc 
cr^cccccca aacacgtcgc agaagcatgt 
acatcaagta catcaaaaac agcaccccaa 
gatcgcraca tctctgguat ttcagcatca 
gracgaagga ggatacaccc accagtttgg 
-gtgac-gcg catcaacacc cacggatacg 
acrcgccarc aactgcgcgc aaaagagctc 
aagcggacac iicgtrgagaa tgttgttgaa 
actgattrgc ::tgtcatcgc catgagatgc 
tcggca::gga caatgattca gacggttacc 
tcaatcgtaa aacaattgga agaatgttca 
rttgaggagc rratcagaga aaactcattg 
gagaaggi:cc tgaaaacatt gtattctgaa 
tacagaaccc rtctttggag aacggrcgac 
cgcagtaant cagttcacca acagttcaaa 
a-cagc-^ctg actcaagagt tccaaagagt 
ccatcaagaa gatcaaggat accattgtca 
tcaaca^c'c gatcagctcg atcaccttca 
arggg-gact tcacacrtgc caatttctcc 
rcaacucgtg gaggacttgc tcgatgtgrc 
aat.ggagtta ctccgattga acatcttctt 

<210> 22 

<211> 612 

<212> PRT 

<213> Caenorhabditis elegans 



gattttcaat 

acaacttggg 

gatgtttgga 

attgccaact 

cgcgaggcaa 

tgctcatctg 

acagcaccaa 

ttcctggcaa 

gatgctatta 

tttccgctct 

cacattgaag 

tcattcttca 

acatcaacaa 

gcaaatgaag 

gctcaaactg 

aatttctcat 

tcaattgtcg 

ggcgttgctg 

ttaattgaag 

tcaaaaggcc 

aagactgaca 

gactgctggc 

aatgcattcc 

agtttgtctc 

tcaatgcaac 

tcaccgttcc 

acaagtcgca 

acgacatctc 

ctctcagacg 

cgtttaacga 

ttccagtga 



tagcattcgc 

gtcggattcg 

atgtaaaaag 

ttgtttccgg 

atgaagaaat 

gaggacaaaa 

cttcaatggc 

tgcatcatga 

ttaaaatgca 

tctttcaaga 

attggcaatc 

agagcaacac 

acatcgtatc 

catgtcgaac 

cccaatcaaa 

ttgcaccaac 

gattgccgat 

caatttcaag 

atggtcttca 

cagctacaaa 

acgtcaaagt 

tgtgctatat 

ttctttctgc 

ttttgccagt 

aatggggcgg 

cagcaagat t 

tcagtatcag 

gtattcgagt 

gagaaaaagt 

cttctcattc 



agactcrgga 60 
gacatctgaa 120 
aagtgatagt 180 
agatgccact 240 
aattcgtaaa 300 
ccagaaacga 360 
aatcacatcg 420 
gatgccatct 480 
gaaatgtgag 540 
agactctccc 600 
tctatctgtt 660 
ccaattatta 720 
aaatrgcatt 780 
gtgttatcga 840 
^gacggrtctt 900 
aacttcaacg 960 
rratgcagca 1020 
aggggatgga 1080 
agaaaatgta 1140 
agatgttcat 1200 
tgaaatcttc 1260 
tgtgtrgaaa 1320 
gagttctgaa 1380 
gatcgaagct 1440 
agcarccaga 1500 
atcaactgcr 1560 
ctcaacaact 1620 
tgcatctata 1680 
tagtgtrcta 1740 
aaaaatcaac 1800 
1839 



<400> 22 
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One Ar-a Giu Arg Cvs Gin Met Lys Lvs Glu Asp Phe Gin Leu Ala Phe 
■ 1 '5 ' 10 15 

;'la Asc Ser Glv His Trp Gin Ser Gly He Asn Asp Asn Leu Thr Thr 
20 25 30 

-r-p G1-- Arg He Arg Thr Ser Glu Pro Leu Asp Glu Arc Thr Ala Ser 
35 40 45 

.-la Pro Asp Val Trp Asn Val Lys Arg Ser Asp Ser Ala Arg Ser Pro 
50 55 60 

Bsn Ar:: Pro Asn Ser Leu He Ala Asn Phe Val Ser Gly Asp Ala Thr 
'65 70 75 80 

Arg P>^- Val Asp Val Asn Asp Asn Glu He Arg Glu Ala Asn Glu Glu 

85 90 95 

Tie Arg Lys Asp Arg Trp Arg Arg Asp Ser Ala Arg Arg Cys Ser 

100 " 105 110 

=:°r Giv Gly Gin Asn Gin Lys Arg Thr Phe Ala Asp He Leu Glu Lys 
115 120 125 

Asn Val Thr Ala Pro Thr Ser Met Ala He Thr Ser Ser Asp Asn Glu 
130 135 140 

Lys Pro Pro Lys Leu Asp Phe Leu Ala Met His His Glu Met Pro Ser 
145 150 155 160 

Leu Cys Glu Ser Phe Thr Ala Ser Phe Arg Asp Ala He He Lys Met 

165 170 175 

Gin Lvs Cys Glu Pro Leu Pro Ser He Thr Ser Thr Asn Asp Phe Pro 
180 185 190 

Leu Pb^- Phe Gin Glu Asp Ser Pro Asp Ser Gly Leu Gly Cys Ser Gly 
195 200 205 

Pro S°r His He Glu Asp Trp Gin Ser Leu Ser Val Leu Leu Pro Lys 
210 215 220 

His Val Ala Glu Ala Cys Ser Phe Phe Lys Ser Asn Thr Gin Leu Leu 
225 230 235 240 

T-h^ S'^r Ser Thr Ser Lys Thr Ala Pro Gin Thr Ser Thr Asn He Val 
245 250 255 

Ser Asn Cys He Asp Arg Arg He Ser Gly He Ser Ala Ser Ala Asn 
260 265 270 

Glu Ala Cys Arg Thr Cys Tyr Arg Val Arg Arg Arg He His Pro Pro 
275 280 285 

Val Trp Ala Gin Thr Ala Gin Ser Lys Thr Val Leu Cys Asp Cys Ala 
290 295 300 

Ser Thr Pro Thr Asp Thr Asn Phe Ser Phe Ala Pro Thr Thr Ser Thr 
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305 310 315 320 

Thr Arg His Gin Leu Arg Ala Lys Glu Leu Ser lie Val Gly Leu Pro 

325 330 335 

He Ty Ala Ala Lvs Arg Thr Leu Val Glu Asn Val Val Glu Gly Val 
340 ' 345 350 

Ala Ala lie Ser Arg Gly Asp Gly Ser Asp Leu Leu Val He Ala Met 
355 360 365 

Arg Cys Leu He Glu Asp Gly Leu Gin Glu Asn Val Ser Ala Trp Thr 

370 375 380 

Met He Gin Thr Val Thr Ser Lys Gly Pro Ala Thr Lys Asp Val His 
385 390 395 400 

Ser He Val Lys Gin Leu Glu Glu Cys Ser Lys Thr Asp Asn Val Lys 

405 410 415 

Val Glu He Phe Phe Glu Glu Leu He Arg Glu Asn Ser Leu Asp Cys 
420 425 430 

Trp Leu Cys Tyr He Val Leu Lys Glu Lys Val Leu Lys Thr Leu Tyr 
435 440 445 

Ser Glu Asn Ala Phe Leu Leu Ser Ala Ser Ser Glu Tyr Arg Thr Leu 
450 455 460 

Leu Trp Arg Met Val Asp Ser Leu Ser Leu Leu Pro Val He Glu Ala 
465 470 475 480 

Arq Ser Asp Ser Val His Gin Gin Phe Lys Ser Met Gin Gin Trp Gly 

485 490 495 

Gly Ala Ser Arg He Ala Ser Asp Ser Arg Val Pro Lys Ser Ser Ser 
500 505 510 

Phe Pro Ala Arg Leu Ser Thr Ala Pro Ser Arg Arg Ser Arg He Pro 
515 520 525 

Leu Ser Thr Ser Arg He Ser He Ser Ser Thr Thr Ser Thr Pro Arg 
530 535 540 

Ser Ala Arg Ser Pro Ser Thr Thr Ser Arg He Arg Val Ala Ser He 
545 550 555 560 

Met Gly Asp Phe Thr Leu Ala Asn Phe Ser Leu Ser Asp Gly Glu Lys 

565 570 575 

Val Ser Val Leu Ser Thr Arg Gly Gly Leu Ala Arg Cys Val Arg Leu 
580 585 590 

Thr Thr Ser His Ser Lvs He Asn Asn Gly Val He Pro He Glu His 
595 ' 600 605 

Leu Leu Phe Gin 
610 
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<210> 23 
<211> 3435 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 23 

atgcaggcga tagatgaacc caagcgaaai: caaaaagrgc cgcccgcaaa acgaaagcgg 60 
atatatttgt ctgacgagga agaggaagar tttgcngaag cggcgcacgt tgagaacaca 120 
gtccccgagc gtgcaactcg acgtagcacc agaagacgtt catcaangca cgaagaactt 180 
ggagnttcag aacaggaaga gtcgccggtt cgcaggacLc gcaaagccgc caagcgcctt 240 
ggaagtgagc aaccagagga aaatctagca gcagargatc cgttgccaat: ggaaggaggt 300 
ggcgagattg tgctaccaat tgctgaaatt gatggaatgg ctgaacaaga aaatgaggac 360 
ttgattgaga aaattggaag ggaagaagag gaggaaggcg ctgaagaaga tgagcaaagc 420 
ggtgaaaaag atccagaaga agaagaggat gatagcagca atgcagaaag ttcagaggaa 480 
tcaactgctc cacgtcaata cagtcttcgc cgccgtcaac cagttgtcca gttcaatgca 540 
tcagaagctc gtgaaaatcg acgggctcgt cttgaacatc arcgggttgc aaatcaaaac 600 
agacatcacc gcaatcgaaa tggatctcgt cgaagacgaa gcgattcgga ttctgatagt 660 
gatgatatgg tgcttccgag acctgataaa cgtcaatcga ggcctcatat gcataaccgt 720 
ggagaacgtg aacgtggtcg atttargccg atcaatatga ccgaaaaaga attgcagtct 780 
gcccaacaca ttctgatgga tcgaargaga aaaactgatg ctggtcaagg tgcaagtgac 840 
attgatccaa tgagtgttga ttcgtcggtt ggtttcgatc aagttggagg actcggtcat 900 
catattcaat ctctcaaaga agttgtcctt ttcccaatgc tttatcctga agtatttgaa 960 
aagttcagaa taaatccgcc aaaaggtgtc gtattctatg ggcccccagg aaccggaaaa 1020 
actcttgttg cacgtgctct tgcgaatgaa tgccgaagag gagccaacaa agttgcattc 1080 
tttatgcgaa aaggagctga ttgtttgagt aaatgggtcg gagagrctga aagacaactt 1140 
cgactattgt ttgatcaagc atatgccatg cgtccarcaa ttatcttttt tgatgaaatc 1200 
gatggtttag ctccagttcg ctcttcgaaa caagatcaaa ttcatgcgag catagtgagt 1260 
acattgttag cactgatgga cggattagat ggtcgaggag aagttgttgt aattggagct 1320 
acaaatcgat tagatactct tgatccagct ctacgaagac caggacgttt tgatagggaa 1380 
ctccgttttt cccttcctga tttgaatgcc cgacgacaaa ttttagatat tcacacttcc 1440 
aaatgggagg aaaacaaacc gattccggaa acactcgatg ctattgcaga gagaactagt 1500 
ggatattgcg gtgccgattt gaaattcttg tgcacagaag cagttcttat tggattgaga 1560 
tctcgatatc ctcatatcta catgtgttct gaaaggccga aactcgacgt tgccaccatt 1620 
aaaatcactt ctaagcattt tggacatgca atgcggagga tcacaccrgc ttccagacga 1680 
gatcttacaa ttccatcgag accattggat gaacgaactt caattttact tggagatact 1740 
gtgagcaact tgatttcgtt gagaattcca caaggaratc gatgcgtgga aaatgcaatg 1800 
gcgacagctt caagcgaact ggaacaagtt gttagggcat tggaacctaa tccaaccgtg 1860 
cctgctattc gtcttcttct atgtggatct gagcaactcg cggacggtgg acaaacgtct 1920 
tacgttcttc ccgccattct agcaaaactg gatcarcnac cggtgttctc tctttctgrt 1980 
tcaagtcttt taacagatgg acgtccagaa gaagcatttt caaatgcaat tcagtcggca 2040 
atgcgtgcat cagcaaccgg cccatgtatc atgcctcttc catcaatcga tgaatggatt 2100 
aaagtcattc ctgtatcagt tcaacatarg cttatcacat gtctggaatc aatgactggt 2160 
ttcacgccga ttcrtttcct atcgacgctt gacactccgt ttgaagatgc gcctgaatat 2220 
gtcacggaaa ttttcagaca tgcaaactgt atcactrtga atccttcccg ccgaacaatt 2280 
cgtcaaaaat atttcgagca cgttattgag aagattaaca ctccaccaaa agtatttgat 2340 
ccacgtctta tgcgagatcg tcgattcgtt gaatttgtcg agcccgtgga tccggatgag 2400 
gctgaggact atiiatgagat tatcgaaaca ccgatttgta tgcaggatat tatggaaaag 2460 
ctgaataatt gcgaatataa tcatgcggac aaatttgttg ctgacttgat attgattcaa 2520 
acgaatgctc tcgaatataa tccgtcgaca acaaaagacg gaaaactgat tcgtcaaatg 2580 
gcaaatacac ttcgagatgc aattgatgat ctgatcgagt gtgaacttga cgaatcgttt 2640 
gtcgaacgaa ttqaaacagt ttcacgaatg ctacaagacg ctggagtaac tccaacctct 2700 
gataaattac tcacagaaat tcccaaaggt tttgcccgaa aaaaggcgtg gtctatgacc 2760 
aattcacttg caaaagaaat tgagcaatgg acatcagagc gtgaagctga aaatcaaaaa 2820 
atgttgagca agcttggtgt agctgctcca acattggaac tcgttgttgt ccctgtagaa 2880 
gatatgaaaa gtgaagaagg aacgagcaca tcaactgatg gagttccagc ttctgctgga 2940 
aataaaaaga aattgctcaa aaagaagaag ggtcaaaaga aaagcaaaac cggcgaatct 3000 
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gaagaacacg atgaagatag tacagttgaa gacgcaggag aagatacaat tgtagaaaa- 3060 

r.r.^^z.r^Y nn^ ^P^r.^iOT^osiac ataatatcQa qatqaaagar 3120 

gcttctaaag attctacacc ttccgtgcaa atttcgattg ctgagaaaga attgatag-- 3180 

tcgaagccag caacatgcga gctgattcag tgttgtgtcg aaaagtcgga gggttggnca 3240 

gtcagcgagt tggaacgttt gtcttctgtt ctatcacata caattgaacg attccgggac 3300 

gaatgaaacc gcgaaaatct accagcccaa cttacccaaa ttgtacgrga atggcaaaca 3360 

gctgacgatt cgaacaatac aattgttaat ggaaccctca acaaatcgaa tggaaaccrt 3420 
gccaarggac actaa 

<210> 24 
<211> 1144 
<212> PRT 

<213> Caenorhabditis elegans 

<400> 24 , 

M°t G'n Ala He Asp Glu Ser Lys Arg Asn Gin Lys Val Pro Pro Ala 

Lys Arg Lys Arg He Tyr Leu Ser Asp Glu Glu Glu Glu Asp Phe Ala 
20 25 30 

Glu Ala Ala His Val Glu Asn Thr Val Pro Glu Arg Ala Thr Arg Arg 
35 40 45 

Ser- Th^ Arg Arg Arg Ser Ser Met His Glu Glu Leu Gly Val Ser Glu 
50 55 60 

Gin Glu Glu Ser Pro Val Arg Arg Thr Arg Lys Ala Ala Lys Arg Leu 
65 -70 75 80 

Glv Ser Glu Gin Pro Glu Glu Asn Leu Ala Ala Asp Asp Pro Leu Pro 

85 90 95 

Met Glu Gly Gly Gly Glu He Val Leu Pro He Ala Glu He Asp Gly 
ICQ 105 110 

Met Ala Glu Gin Glu Asn Glu Asp Leu He Glu Lys He Gly Arg Glu 
115 120 125 

Glu Glu Glu Glu Gly Ala Glu Glu Asp Glu Gin Ser Gly Glu Lys Asp 
130 135 140 

Pro Glu Glu Glu Glu Asp Asp Ser Ser Asn Ala Glu Ser Ser Glu Glu 
145 150 155 loO 

Ser Ala Pro Arg Gin Tyr Ser Leu Arg Arg Arg Gin Pro Val Val 

165 170 175 

Gin °re Asn Ala Ser Glu Ala Arg Glu Asn Arg Arg Ala Arg Leu Glu 
180 185 190 

His His Arg Val Ala Asn Gin Asn Arg His His Arg Asn Arg Asn Giy 
195 200 205 

Ser Arg Arg Arg Arg Ser Asp Ser Asp Ser Asp Ser Asp Asp Met Val 
210 215 220 

Leu Pro Arg Pro Asp Lys Arg Gin Ser Arg Pro His Met His Asn Arg 
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225 230 235 240 

Gly Glu Arg Giu Arg Glv Arg Phe Met Pro He Asn Met Thr Glu Lys 
245 ^ 250 255 

Glu Leu Gin Ser Ala Gin His He Leu Met Asp Arg Met Arg Lys Thr 

26C 265 270 

Aso hla Glv Gin Gly Ala Ser Asp He Asp Pro Met Ser Val Asp Ser 
275 280 285 

Ser Val Glv Phe Asp Gin Val Gly Gly Leu Gly His His He Gin Ser 
290 ^ 295 300 

Leu Lvs Glu Val Val Leu Phe Pro Met Leu Tyr Pro Glu Val Phe Glu 
305 ' 310 315 320 

Lys Phe Arg He Asn Pre Pro Lys Gly Val Val Phe Tyr Gly Pro Pro 
325 330 335 

Gly Thr Glv Lys Thr Lr.^ Val Ala Arg Ala Leu Ala Asn Glu Cys Arg 
340 345 350 

Arg Gly Ala Asn Lvs Va^ Axa Phe Phe Met Arg Lys Gly Ala Asp Cys 
355 ' 360 365 

Leu Ser Lys Tro Val Glv Glu Ser Glu Arg Gin Leu Arg Leu Leu Phe 
370 ^ ' 375 380 

Asp Gin Ala Tvr Ala Met Arg Pro Ser He He Phe Phe Asp Glu He 
385 ' 390 395 400 

Asp Gly Leu Ala Pro Val Arg Ser Ser Lys Gin Asp Gin He His Ala 
405 410 415 

Ser He Val Ser Thr Leu Leu Ala Leu Met Asp Gly Leu Asp Gly Arg 
420 425 430 

Gly Glu Val Val Val He Gly Ala Thr Asn Arg Leu Asp Thr Leu Asp 
435 440 445 

Pro Ala Leu Arg Arg Pro Gly Arg Phe Asp Arg Glu Leu Arg Phe Ser 
450 455 460 

Leu Pro Asp Leu Asn Ala Arg Arg Gin He Leu Asp He His Thr Ser 
465 470 475 480 

Lys Tro Glu Glu Asn Lvs Pro He Pro Glu Thr Leu Asp Ala He Ala 
485 " 490 495 

Glu Arg Thr Ser Gly Tyr Cys Gly Ala Asp Leu Lys Phe Leu Cys Thr 
500 505 510 

Glu Ala Val Leu He Gly Leu Arg Ser Arg Tyr Pro His He Tyr Met 
515 520 525 

Cys Ser Glu Arg Leu Lys Leu Asp Val Ala Thr He Lys He Thr Ser 
530 - 535 540 
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r-in Hie Ph<. r.lv His Ala Met Arq Arg He Thr Pro Ala Ser Arg Arg 
i45 ^ 550 555 5bU 

z^sD Leu Thr He Pro Ser Arg Pro Leu Asp Glu Arg Thr Ser He Leu 

565 570 575 

Leu Gly Asp Thr Val Ser Asn Leu He Ser Leu Arg He Pro Gin Gly 
580 585 590 

-yr Arg Cys Val Glu Asn Ala Met Ala Thr Ala Ser Ser Glu Leu Glu 
595 600 605 

Gin val val Arg Ala Leu Glu Pro Asn Pro Thr Val Pro Ala He Arg 
610 615 620 

Leu Leu Leu Cys Gly Ser Glu Gin Leu Ala Asp Gly Gly Gin Thr Ser 
625 630 635 640 

Tyr Val Leu Pro Ala He Leu Ala Lys Leu Asp His Leu Pro Val Phe 

645 650 655 

Sor Leu Ser Val Ser Ser Leu Leu Thr Asp Gly Arg Pro Glu Glu Ala 
660 665 670 

Phe Ser Asn Ala He Gin Ser Ala Met Arg Ala Ser Ala Thr Gly Pro 
675 680 685 

Cys He Met Leu Leu Pro Ser He Asp Glu Trp He Lys Val He Pro 
690 695 "700 

Val Ser Val Gin His Met Leu He Thr Cys Leu Glu Ser Met Thr Gly 
705 710 -715 -720 

Phe Thr Pro He Leu Phe Leu Ser Thr Leu Asp Thr Ser Phe Glu Asp 

725 730 735 

Ala Pro Glu Tyr Val Thr Glu He Phe Arg His Ala Asn Cys He Thr 
740 745 750 

Leu Asn Pro Ser Arg Arg Thr He Arg Gin Lys Tyr Phe Glu His Val 
755 760 765 

He Glu Lys He Asn Thr Pro Pro Lys Val Phe Asp Pro Arg Leu Met 
770 775 780 

Arg Asp Arg Arg Phe Val Glu Phe Val Glu Pro Val Asp Pro Asp Glu 
785 790 795 800 

Ala Glu Asp Tyr Tyr Glu He He Glu Thr Pro He Cys Met Gin Asp 

805 810 815 

Tie Met Glu Lys Leu Asn Asn Cys Glu Tyr Asn His Ala Asp Lys Phe 
820 825 830 

Val Ala Asp Leu He Leu He Gin Thr Asn Ala Leu Glu Tyr Asn Pro 
835 840 845 
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Ser Thr Thr Lvs Asp Glv Lys Leu lie Arg Gin Mer Ala Asn Thr Leu 
850 ^ ^ 855 860 

Arg Asp Ala He Asp Asp Leu He Glu Cys Glu Leu Asp Giu Ser Phe 
865 ■ 870 875 880 

Val G'u Arg He Glu Thr val Ser Arg Met Leu Gin Asp Ala Gly Val 

885 890 895 

Thr Pro Thr Ser Asp Lys Leu Leu Thr Glu lie Pro Lys Gly Phe Ala 
900 905 910 

Arg Lys Lys Ala Trp Ser Met Thr Asn Ser Leu Ala Lys Glu He Glu 
915 920 925 

Gin Trp Thr Ser Giu Arg Glu Ala Glu Asn Gin Lys Met Leu Ser Lys 
930 935 940 

Leu Gly Val Ala Ala Pro Thr Leu Glu Leu Val Val Val Pro Val Glu 
945 950 955 960 

Asp Met Lys Ser Glu Glu Giv Thr Ser Thr Ser Thr Asp Gly Val Pro 

965 970 975 

Ala Ser Ala Gly Asn Lys Lvs Lys Leu Leu Lys Lys Lys Lys Gly Gin 
980 " 985 990 

Lys Lys Ser Lys Thr Gly Glu Ser Glu Glu His Asp Glu Asp Ser Thr 
995 1000 1005 

Val Glu Asp Ala Gly Glu Asp Thr He Val Glu Asn Leu Glu He Lys 
1010 1015 1020 

Lys Asn Gin Glu Thr Pro Asn Ser Glu His Asp He Glu Met Lys Asp 
1025 1030 1035 1040 

Ala Ser Lvs Asp Ser Thr Pro Ser Val Gin He Ser He Ala Glu Lys 
1045 1050 1055 

Glu Leu He Val Ser Lys Pro Ala Thr Cys Glu Leu He Gin Cys Cys 
1060 1065 1070 

Val Glu Lys Ser Glu Glv Trp Ser Val Ser Glu Leu Glu Arg Leu Ser 
1075 * 1080 1085 

Ser Val Leu Ser His Thr He Glu Arg Phe Arg Asp Glu Trp Asn Arg 
1090 1095 1100 

Glu Asn Leu Pro Ala Gin Leu Thr Gin He Val Arg Glu Trp Gin Thr 
1105 1110 1115 1120 

Ala Asp Asp Ser Asn Asn Thr He Val Asn Gly Thr Leu Asn Lys Ser 
1125 1130 1135 



Asn Gly Asn Leu Ala Asn Gly His 
1140 
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<210> 25 
<211> ^ 90S 
<212> DMA 

<213> Zaenorhabditis eiegans 



<400> 25 
gaagaa r aa g 
caatacag^c 
aarcgarggg 
cgaaar agar 
ccgagacctg 
ggtcga" zca 
atgga" "gaa 
gttga" -cgr 
aaagaagrrg 
ccgccaaaag 
gctctzgcga 
gctgarrgtT: 
caagca-ang 
gttcgtzct-L 
atggacggat 
actctr gate 
cctga^" rga 
aaaccgattc 
gatttgaaat 
atctacatgt 
catttrggac 
tcgagaccat 
tcgttgagaa 
gaactggaac 
cttctargtg 
attctagcaa 
gatggacgtc 
accggcccat 
tcagt -caac 
ttcctatcga 
agaca -gcaa 
gagcacgtra 



aggat.ga'cag 
zzcQCcqccq 
czcgzctzga 
crcgtcgaag 
ataaacgrca 
tgccgatcaa 
"CQagaaaaac 
cggtrggttr 
- cctttcccc 
gzg t cgta t L 
avigaa"cgccg 
tgagtaaar r; 
ccatgcgtcc 
cgaaacaaca 
r. a g a t: g g r r 
caqczczacz 
aLgcccga eg 
cgqaaacact 
LCLtqtiqcac 
guLCtgaaag 
aLgcaacgcG 
uggangaacg 
r tccacaagg 
aagttg ttag 
ga tctgagca 
aa c t^g g a cca 
cagaagaagc 
gtatcargcTi 
atatgcttat 
cgct tgacac 
acrg tatcac 
ttgagaaqat 



cagcaatgca 
tcaaccagtt 
acatcat egg 
acgaagcgat 
atcgaggcct 
tatgaccgaa 
tgatgctggt 
cgatcaagtt 
aatgctttat 
ctatgggccc 
aagaggagcc 
ggtcggagag 
aucaattatc 
r caaattcat 
a ggagaagtt 
aagaccagga 
acaaatttta 
cgatgctatt 
agaagcagtr 
gcrgaaactc 
gaggatcaca 
aact tcaatt 
a tatcgatgc 
ggcattggaa 
actcgcggac 
uctaccggtg 
attttcaaat 
tcttccatca 
cacatgtctg 
rtcgtttgaa 
t trgaatcct 
raacactcca 



gaaagttcag 
gtccagr tea 
gttgcaaane 
tcggatretg 
catatgcata 
aaagaattgc 
caaggtgeaa 
ggaggaetcg 
cctgaagtat 
ccaggaaccg 
aacaaagttg 
tctgaaagac 
ttttttgatg 
gcgagcatag 
gttgtaattg 
cgttttgata 
gatattcaca 
gcagagagaa 
cttattggat 
gacgttgcca 
cctgcttcca 
ttacttggag 
gtggaaaatg 
cctaatccaa 
ggtggacaaa 
ttctetcttt 
gcaattcagt 
arcgatgaat 
gaatcaatga 
gangcgcctg 
teccgccgaa 
ccaaaag rat 



aggaa tcaac 
a tgeatcaga 
aaaacagaca 
atagtgatga 
accgtggaga 
agtctgccca 
gtgacattga 
gtcatcatat 
ttgaaaagtt 
gaaaaactct 
cattctttat 
aacttcgact 
aaatcgatgg 
tgagtacatt 
gagctacaaa 
gggaactccg 
ctcecaaatg 
ctagtggata 
tgagatctcg 
ccattaaaan 
gacgagatct 
atactgtgag 
caatggcgac 
ccgtgcctgc 
cgtcttacgt 
ctgtttcaag 
cggcaatgcg 
ggattaaagt 
ctggtttcac 
aatatgtcac 
caattcgtca 
ttgatcca 



tgctccacgT: 60 
agctcgtgaa 120 
tcaccgcaae 180 
tatggtgct-L 240 
acgtgaacge 300 
acacattczg 360 
tccaatgagt: 420 
tcaatctcte 480 
cagaataaat 540 
tgttgcacgt 600 
gcgaaaagga 660 
attgtttgai: 720 
tttagctcca 780 
gttagcactg 840 
tcgattagar 900 
tttttccctt 960 
ggaggaaaac 1020 
ttgcggtgce 1080 
atatcctcat 1140 
cacttctgag 1200 
tacaattcea 1260 
caacttgatt 1320 
agcttcaagc 1380 
tattcgtctt 1440 
tcttcccgcc 1500 
tcttttaaca 1560 
tgcatcagca 1620 
cattcctgra 1680 
gccgattcct 1740 
ggaaatttnc 1800 
aaaatatt-c 1860 
1908 



<210> 26 
<211> 636 
<212> PRT 

<213> Caenorhabditis eiegans 



GlS^G-u^lu Glu ASP Asp Ser Ser Asn Ala Glu Ser Ser Glu Glu Ser 
1 " 5 10 

Thr Ala Pro Arg Gin Tyr Ser Leu Arg Arg Arg Gin Pro Val Val Gin 
20 25 30 

Phe A-n Ala Ser Glu Ala Arg Glu Asn Arg Arg Ala Arg Leu Glu His 
35 40 4^ 

His P-g val Ala Asn Gin Asn Arg His His Arg Asn Arg Asn Gly Ser 
50 55 SO 

Arg Arg Arg Arg Ser Asp Ser Asp Ser Asp Ser Asp Asp Met Val Leu 
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65 70 75 80 

oro Asp Lys Arg Gin Ser Arg Pro His Met His Asn Arg Gly 
85 90 95 



i 'J J- 



Glu Arg Gly Arg Phe Met Pro He Asn Met Thr Glu Lys Glu 
100 i05 110 

- Ser Ala Gin His He Leu Met Asp Arg Met Arg Lys Thr Asp 

115 120 125 

- ' • -'n Gly Ala Ser Asp He Asp Pro Met Ser Val Asp Ser Ser 

135 140 

- ^sp Gin Val Glv Glv Leu Gly His His He Gin Ser Leu 
.r ' ' * ^ 150 ^ ^ 155 160 

■- ^ * • Val Leu Phe Pro Met Leu Tyr Pro Glu Val Phe Glu Lys 
" 165 170 175 

1'^ Asn Pro Pro Lys Glv Val Val Phe Tyr Gly Pro Pro Gly 
180 ' 185 190 

• Th- Leu Val Ala Arg Ala Leu Ala Asn Glu Cys Arg Arg 

' ill- 200 205 

3lv A-^ As:-- Lys Val Ala Phe Phe Met Arg Lys Gly Ala Asp Cys Leu 



2 1 r 



215 220 



L-s '^-p Val Gly Glu Ser Glu Arg Gin Leu Arg Leu Leu Phe Asp 
225 *' " 230 235 240 

Zln ^la Tvr Ala Met Arg Pro Ser He He Phe Phe Asp Glu He Asp 
245 250 255 

'~-l^' Leu ?'a Pro Val Arg Ser Ser Lvs Gin Asp Gin He His Ala Ser 
260 265 270 

H^ Val Thr Leu Leu Ala Leu Met Asp Gly Leu Asp Gly Arg Gly 

27b 280 285 

Glu Val Val Val He Gly Ala Thr Asn Arg Leu Asp Thr Leu Asp Pro 
290 295 300 

:\ia L^u Arc ^rg Pro Gly Arg Phe Asp Arg Glu Leu Arg Phe Ser Leu 
305 ^ 310 315 320 

Pro P^^o Leu Asn Ala Arg Arg Gin He Leu Asp He His Thr Ser Lys 
325 330 335 

Trp Glu Glu Asn Lys Pro He Pro Glu Thr Leu Asp Ala He Ala Glu 
340 345 350 

Arg Tnr Ser Gly Tyr Cys Gly Ala Asp Leu Lys Phe Leu Cys Thr Glu 
355 360 365 

Ala Val Leu He Gly Leu Arg Ser Arg Tyr Pro His He Tyr Met Cys 
370 375 380 
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Ser Gin Arn T.Pn Lvs Leu Asd Val Ala Thr He Lys He Thr Ser Giu 
335 " ^ 390 " 395 400 

His Phe Glv His Ala Met Arg Arg He Thr Pro Ala Ser Arg Arg Asp 

405 410 415 

Leu Thr He Pro Ser Arg Pro Leu Asp Glu Arg Thr Ser He Leu Leu 
420 425 430 

Gly Asd Thr Val Ser Asn Leu He Ser Leu Arg He Pro Gin Gly Tyr 
435 440 445 

Arg Cys Val Glu Asn Ala Met Ala Thr Ala Ser Ser Glu Leu Glu Gin 
450 455 460 

Val Val Arq Ala Leu Glu Pro Asn Pro Thr Val Pro Ala He Arg Leu 
465 470 475 480 

Leu Leu Cys Glv Ser Glu Gin Leu Ala Asp Gly Gly Gin Thr Ser Tyr 

485 490 495 

Val Leu Pro Ala He Leu Ala Lys Leu Asp His Leu Pro Val Phe Ser 
500 505 510 

Leu Ser Val Ser Ser Leu Leu Thr Asp Gly Arg Pro Glu Glu Ala Phe 
515 520 525 

Ser Asn Ala He Gin Ser Ala Met Arg Ala Ser Ala Thr Gly Pro Cys 
530 535 540 

He Met Leu Leu Pro Ser He Asp Glu Trp He Lys Val He Pro Val 
545 550 555 560 

Ser Val Gin His Met Leu He Thr Cys Leu Glu Ser Met Thr Gly Phe 

565 570 575 

Thr Pro He Leu Phe Leu Ser Thr Leu Asp Thr Ser Phe Glu Asp Ala 
580 585 590 

Pro Glu Tyr Val Thr Glu He Phe Arg His Ala Asn Cys He Thr Leu 
595 600 605 

Asn Pro Ser Arg Arg Thr He Arg Gin Lys Tyr Phe Glu His Val He 
610 615 620 

Glu Lys He Asn Thr Pro Pro Lys Val Phe Asp Pro 
625 630 635 



<210> 27 
<211> 3024 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 27 

atggaacgac acggttcaga attacaagga gaagtggttt tcactacgac gagtactgcg 60 
aataacaatg tgaaacgagc cgtcgaatat ttcgaagatg ataatcaaga tgatgcacta 120 
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acatccactt catccggcaa cagtacacag 
gatgTitccgc cgccaccagt ggctccagaa 
tctgcgtccc ccgaacgtca attccttgat 
ttatcacaga gcacaaggct cgatgagacg 
acccctggga aaaacaacat ttcctccgca 
gacgatctra ctgatacagc tgccccggta 
gaggaacatc gccggcaacc acatcagaat 
caguztagag ggggaacrcL gtccggtaga 
aacgaaaacc agcrccgcga gaagaagctc 
tcctcczctz ccgaagaaat tgtggaagcc 
gctcaaattc cagaggagaa gcccaaaccg 
gacgccgacL trgacaattc cccaccactt 
gggctctcac cgcttgagga ggctcagaga 
ccgtcgaatg ratcgttggc agaggaggcg 
gcatcggata tragaagaga agaggaggaa 
gttccagttc cacggaagga tctggaagtt 
agaratcgga agrcaaggga aattgaggaa 
gaagtrgatg agaaggagga tattgatgcg 
tccgccgazc caaaatctca cacgccgatt 
aagcaagtgc ctgcaccagt tgtcgttgcr 
"ctccagtac catcaaggat cagtggaagg 
cggagtatga gaggaaaacg gaagacccgt 
catcagaaca trcggctccg tgctccagct 
gtcgaggaca ccgaagtcgt cgtcgagctc 
cgaagrgacg tcatctcgcg tgacgtgttc 
gagcacgtgt ncttcgggct ctcgttcctt 
catcaacggc ccgaaaagtt tgcaccgtcc 
aaggtgccat acgrgttgca cttacgattc 
aaaacagacg raacgatgaa cgagttgtat 
agaatccaac ccaaaagaga cgcagcattc 
tttggaaatc gaccgccacc ggtgatcacc 
aagaagnata gcucatttga agatcagtca 
gggcattacg cgggcacccg gatttctgag 
cgacacccag artttggagc gcatgtgcat 
cacggagcat crccgttcga tcctgatact 
cgtggaatat cgattratga gcagcaggga 
tggccgcaga cgcagaccct gcaattcgat 
gctcacgatg agcaaattga gagcacattt 
ttcgtgcgat Lcgccgcctc ccaacatcgt 
acgctccggc acgagaatac gattcaggcg 
atcccgccgg cgcctatacg gcaaggtctc 
gcctctgccg atcagatctt tgcgaaaatg 
cgacccgccg aactgccacc accggcaccg 
gttgatgaag gaattgtttg tgattcccaa 
gacaacggga acgtgacgcc acgtggcatg 
gccaacggtc tgggtctcac cctagtcgac 
gtgaagcttg rcgccgacaa tggagccgga 
caatatccct tttcctcggg gtgtacacta 
tcgattatct ctgaaaaatc gtaa 

<210> 28 
<211> 1007 
<212> PRT 

<213> Caenorhabditis elegans 



44 

aaggaarcgt ctccgttcac agattttgat 180 
actccggcrc cagcccaaaa ccgccgagaa 240 
gagagtcacc tcggcggaat tggatctccg 300 
tttatrgagg aatactcnat tgagttggac 360 
gcttcacctg gtcccaagtc tccatttgac 420 
gccccgccac cagcaccuac aaaagctgcc 480 
ccgtttgarg aggaagagga agaagaatca 540 
gacccacri:g argaagattc rggaaactcT: 600 
cacaaaaaag agcaactagc rcaccggctc 660 
tcaattcacg aggatgagcc aattgtgarg 720 
aaggctatcc cggcttttga taacgcctat 780 
catcattact cggcggttca tttggaaact 840 
gctctccgag ccaaccgagc aaggcaraag 900 
aagcttgctg ccagacagag atactcgaat 960 
gaagtggcgg aagaagatcc agcagtggta 1020 
gaggaggctc cgaaatcggt acgaccgcct 1080 
ccggtggtrg ttgatcggtt tgttgaggaa 1140 
atttttgaga aaratcgaaa gacgtctgta 1200 
ttgatggccg acgagtacaa agaaccacag 1260 
caggagccgc cgatctraaa acgaagaaac 1320 
cagtcgacgc ggcgaagcgt caccagtgtr 1380 
gcgattcccg agttcttcga cctaacccga 1440 
acgaaaaaga aacgcataag cctacaccgc 1500 
ctgaacggcc aaaaagttga agtggcatgc 1560 
tctcttatcg tccagaacat gaatatcaat 1620 
cgtgatggcg agcactactt tatagaggat 1680 
ggatggaaaa gtgtggcacg cgtgggcgtt 1740 
aaattttatc cgcagatttt ggatttcatc 1800 
ctacaatgta gacgagacgt actcgaagag 1860 
gaactcgccg cgctcgccct ccaagccgaa 1920 
gattatttcg atattcaaca ctacctgccc 1980 
cggctgaaga acattcrcgc cgagctgcac 2040 
gccaagcaca aatacattca aatctgtcag 2100 
cgtgtgttcc gaacgaagcc aacatcagct 2160 
gggtcatcgc tttggattgg aattatgcca 2220 
ggagcccgtg aggtgattgc cgagcatgtt 2280 
aagaagcgat ttgtgattgt tgccgttgga 2340 
tatacggacc atcactcgaa atcctcctat 2400 
tggatgatga agatgcgtca gtggaagagc 2460 
atgccagacg tgattgttga agggcagact 2520 
gaagaatcac ctccatcaac tccactgctc 2580 
tcagtgagcc aggaaaaacc ggcggccaac 2640 
tcgtcgaaat tcgcggctca atatgatacg 2700 
gctgaaaact ttgagcgcgt cgactcgact 2760 
caatttgaca ttttgctcgt gaaagacccc 2820 
gggaatctca acggagttcc aggagtgtat 2880 
atgaaagcgg taagaatrcg aaatttcagt 2940 
gagttgtgta aatctacaca taacgttttt 3000 

3024 



<400> 28 

Met Glu Arg His Gly Ser Glu Leu Gin Gly Glu Val Val Phe Thr Thr 
15 10 15 
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^-^^ ^Kv- asn Bsn Asn Val Lvs Arq Ala Val Glu Tyr Phe Glu 

^ "20 25 30 

^3- Asn Gin Asp AsD Ala Leu Thr Ser Thr Ser Ser Gly Asn Ser 

"" " 35 ' 40 45 

/ 

--- '. - -v= Glu Ser Ser Pre Phe Thr Asp Phe Asp Asp Val Pro Pro 

--- ---^ 55 60 

• • r.- 'ai Ala Pro Glu Thr Pro Ala Pro Ala Gin Asn Arg Arg Glu 
~ " 70 75 80 

- ■ Pro Glu Arg Gin Phe Leu Asp Glu Ser His Leu Gly Gly 

85 90 95 

^ Pro Leu Ser Gin Ser Thr Arg Leu Asp Glu Thr Phe lie 

100 105 110 

Ser He Glu Leu Asp Thr Ser Gly Lys Asn Asn He Ser 
-" ' . 120 12 5 



.- - _ r-- s°- Pro Gly Pro Lys Ser Pro Phe Asp Asp Asp Phe Thr 

ill ' " ' 135 140 

-.-r :. ::.la Pro Val Ala Pro Pro Pro Ala Pro Thr Lys Ala Ala 

V--^ " ■ " 150 155 160 

u G- ^v- Arq Arg Gin Pro His Gin Asn Pro Phe Asp Glu Glu Glu 

^ 165 170 175 



1.. Ser Gin Phe Gly Gly Gly Thr Leu Ser Gly Arg Asp Pro 

180 



185 190 



A" Giu Asp Ser Glv Asn Ser Asn Glu Asn Gin Leu Arg Glu Lys 
' -c' 200 205 



Lys Leu His 

21 



Lv^ Lys Glu Gin Leu Ala His Arg Leu Ser Ser Ser Ser 
215 220 



Glu G-- Val Glu Ala Ser He His Glu Asp Glu Pro He Val Met 

225 230 235 240 

-la H- Pro Glu Glu Lvs Pro Lys Pro Lys Ala He Pro Ala Phe 

245 ' 250 255 

Tvr Asp Ala Asp Phe Asp Asn Ser Pro Pro Leu His His 
'■ - " ' 260 265 270 

Tvr S-' Ala Val His Leu Glu Thr Gly Leu Ser Pro Leu Glu Glu Ala 
275 280 285 

Gin n-a A^a Leu Arg Ala Asn Arg Ala Arg His Lys Pro Ser Asn Val 
250 295 300 

ser T..- Ala Glu Glu Ala Lys Leu Ala Ala Arg Gin Arg Tyr Ser Asn 
305 310 315 320 
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Aia Ser Asd lie Arg Arg Glu Glu Glu Glu Glu Val Val Glu Giu Asp 

325 330 335 

^ro Ala Val Val Val Pro Val Leu Arg Lys Asp Leu Glu Val Glu Giu 
340 345 , 350 

Ala Pro Lys Ser Val Arg Pro Pro Arg Tyr Arg Lys Ser Arg Glu He 
355 360 365 

Glu Glu Pro Val Val Val Asp Arg Phe Val Giu Glu Giu Val Asp Glu 
370 375 380 

Lys Glu Asp He Asp Ala He Phe Glu Lys Tyr Arg Lys Thr Ser Val 
385 390 395 400 

Ser Aia Asp Pro Lys Ser His Thr Pro He Leu Met Ala Asp Glu Tyr 
405 410 415 

Lys Glu Pro Gin Lys Gin Val Pro Ala Pro Val Val Val Aia Gin Giu 
420 425 430 

Ser Pro He Leu Lys Arg Arg Asn Ser Leu Val Pro Ser Arg He Ser 
435 440 445 

Giy Arg Gin Ser Thr Arg Arg Ser Val Thr Ser Val Arg Ser Met Arg 
450 455 460 

Giy Lys Arg Lys Thr Arg Ala He Pro Glu Phe Phe Asp Leu Thr Arg 
465 470 475 480 

His Gin Asn He Arg Leu Arg Aia Pro Aia Thr Lys Lys Lys Arg He 

485 490 495 

Ser Leu His Arg Val Giu Asp Thr Glu Val Val Val Glu Leu Leu Asn 
500 505 510 

Giy Gin Lys Val Glu Val Aia Cys Arg Ser Asp Val He Ser Arg Asp 
515 520 525 

Val Phe Ser Leu He Val Gin Asn Met Asn He Asn Glu His Val Phe 
530 535 540 

Phe Giy Leu Ser Phe Leu Arg Asp Giy Glu His Tyr Phe lie Glu Asp 
545 550 555 560 

His Gin Arg Leu Glu Lys Phe Ala Pro Ser Giy Trp Lys Ser Val Aia 

565 570 575 

Arg Val Giy Val Lys Val Pro Tyr Val Leu His Leu Arg Phe Lys Phe 
580 585 590 

Tyr Pro Gin lie Leu Asp Phe He Lys Thr Asp Val Thr Met Asn Glu 
595 600 605 

Leu Tyr Leu Gin Cys Arg Arg Asp Val Leu Giu Glu Arg He Gin Pro 
610 615 620 



Lys 



Arg Asp Aia Aia Phe Giu Leu Aia Ala Leu Ala Leu Gin Aia Giu 
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635 640 



-be Glv Asn Arq Pro Pro Pro Val He Thr Asp Tyr Pne Asp lie Gin 
■ " ■ " 650 655 

■--^s TV- L<=u Pro Lys Lvs Tyr Ser Ser Phe Glu Asp Gin 3er Arg Leu 
650 ' 665 670 

r.s' -1- Leu Ala Glu Leu His Gly His Tyr Ala Gly Thr Arg He 
" ' 67 5 68 0 68 5 

==r G^u Ala Lys His Lys Tyr He Gin He Cys Gin Arg His Pro Asp 
590 695 "700 

-he Glv Ala His Val His Arg Val Phe Arg Thr Lys Pro Thr Ser Ala 
705 ' 710 715 

^■is Glv Ala ser Pro Phe Asp Pro Asp Thr Gly Ser Ser Leu Trp He 

725 730 '35 

-iv 7 1- M=r Pro Arg Gly He Ser He Tyr Glu Gin Gin Gly Gly Ala 
740 745 750 

^rq Glu val He Ala Glu His Val Trp Pro Gin Thr Gin Thr Leu Gin 
755 760 765 

-he Aso Lvs Lys Arg Phe Val He Val Ala Val Gly Ala His Asp Glu 
770 ' 775 780 

Gin He Glu Ser Thr Phe Tyr Thr Asp His His Ser Lys Ser Ser Tyr 
785 790 795 800 

Phe Val Arg Phe Ala Ala Ser Gin His Arg Trp Met Met Lys Met Arg 

805 810 815 

Gin Tro Lvs Ser Thr Leu Arg His Glu Asn Thr He Gin Ala Met Pro 
- ^ 820 825 830 

Asp Val He Val Glu Gly Gin Thr He Pro Pro Ala Pro He Arg Gin 
• ^ 835 840 845 

Gly Leu Glu Glu Ser Pro Pro Ser Thr Pro Leu Leu Ala Ser Ala Asp 
850 855 860 

Gin He Phe Ala Lys Met Ser Val Ser Gin Glu Lys Pro Ala Ala Asn 
865 870 875 880 

Arg Pro Ala Glu Leu Pro Pro Pro Ala Pro Ser Ser Lys Phe Ala Ala 

885 890 895 

Gin Tyr Asp Thr Val Asp Glu Gly He Val Cys Asp Ser Gin Ala Glu 
900 905 910 

Asn Phe Glu Arg Val Asp Ser Thr Asp Asn Gly Asn Val Thr Pro Arg 
915 920 925 

Gly Met Gin Phe Asp He Leu Leu Val Lys Asp Pro Ala Asn Gly Leu 
930 935 940 
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Gly Leu Thr Leu Val Asp Gly Asn 
945 950 

Vai Lys Leu Val Ala Asp Asn Gly 

965 

Arq Asn Phe Ser Gin Tyr Pro Phe 
980 

Cys Lys Ser Thr His Asn Val Phe 
995 1000 
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Leu Asn Gly Val Pro Gly Val Tyr 
955 960 

Ala Gly Met Lys Ala Val Arg lie 
970 975 

Ser Ser Glv Cys Thr Leu Glu Leu 
985 ' " 990 

Ser lie lie Ser Glu Lys Ser 
1005 



<210> 29 
<211> 1311 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 29 

aaaccgaagg ctatcccggc ttttgataac gcctatgacg ccgactttga caattcccca 60 

ccacttcatc attactcggc ggttcatttg gaaactgggc tctcaccgct tgaggaggct 120 

cagagagctc tccgagccaa ccgagcaagg cataagccgr cgaatgtatc gttggcagag 180 

gaggcgaagc trgctgccag acagagatac tcgaatgcat cggatattag aagagaagag 240 

gaggaagaag tggtggaaga agatccagca gtggtagttc cagttctacg gaaggatctg 300 

gaagttgagg aggctccgaa atcggtacga ccgcctagat atcggaagtc aagggaaatt 360 

gaggaaccgg tggttgttga tcggtttgtt gaggaagaag ttgatgagaa ggaggatatt 420 

gatgcgattt ttgagaaata tcgaaagacg tctgtatccg ccgatccaaa atctcacacg 480 

ccgattttga tggccgacga gtacaaagaa ccacagaagc aagtgcctgc accagttgtc 540 

gttgctcagg agtcgccgat cttaaaacga agaaactcrc tagtaccatc aaggatcagt 600 

ggaaggcagt cgacgcggcg aagtgtcacc agtgttcgga gtatgagagg aaaacggaag 660 

acccgtgcga ttcccgagtt cttcgaccta acccgacatc agaacattcg gctccgtgct 720 

ccagctacga aaaagaaacg cataagccta caccgcgtcg aggacaccga agtcgtcgtc 780 

gagctcctga acggccaaaa agttgaagtg gcatgccgaa gtgacgtcat ctcgcgtgac 840 

gtgttctctc ttatcgtcca gaacatgaat atcaatgagc acgtgttctt cgggctctcg 900 

ttccttcgtg atggcgagca ctactttata gaggatcatc aacggctcga aaagtttgca 960 

ccgtccggat ggaaaagtgt ggcacgcgtg ggcgttaagg tgccatacgt gttgcactta 1020 

cgattcaaar tttatccgca gattttggat ttcatcaaaa cagacgtaac gatgaacgag 1080 

trgtatctac aatgtagacg agacgtactc gaagagagaa tccaacccaa aagagacgca 1140 

gcattcgaac tcgccgcgct cgccctccaa gccgaatttg gaaatcgacc gccaccggtg 1200 

atcaccgatt atttcgatat tcaacactac ctgcccaaga agtatagctc atttgaagat 1260 
cagtcacggc tgaagaacat tctcgccgag ctgcacgggc attacgcggg c 1311 

<210> 30 
<211> 437 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 30 

Lys Pro Lys Ala lie Pro Ala Phe Asp Asn Ala Tyr Asp Ala Asp Phe 
15 10 15 

Asp Asn Ser Pro Pro Leu His His Tyr Ser Ala Val His Leu Glu Thr 

20 25 30 

Gly Leu Ser Pro Leu Glu Glu Ala Gin Arg Ala Leu Arg Ala Asn Arg 
35 40 45 
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Ala Arq His Lys Pro Ser Asn Vai Ser Leu Ala Glu Glu Ala Lys Leu 



CD 



Ala Ala Arq Gin Arg Tyr Ser Asn Ala Ser Asp He Arg Arg Glu Glu 
65 70 75 80 

Glu Glu Glu Val Val Glu Glu Asp Pro Ala Vai Val Val Pro Val Leu 

85 90 95 

Prg Lys Asp Leu Glu Val Glu Glu Ala Pro Lys Ser Val Arg Pro Pro 
100 105 110 

Arq Tvr Arq Lys Ser Arg Glu He Glu Glu Pro Val Val Val Asp Arg 
115 120 125 

Phe Val Glu Glu Glu Val Asp Glu Lys Glu Asp He Asp Ala He Phe 
130 135 140 

Glu Lys Tyr Arg Lys Thr Ser Val Ser Ala Asp Pro Lys Ser His Thr 
145 150 155 160 

Pro He Leu Met Ala Asp Glu Tyr Lys Glu Pro Gin Lys Gin Val Pro 

165 170 175 

Ala P^o Val Val Val Ala Gin Glu Ser Pro He Leu Lys Arg Arg Asn 
180 185 190 

Ser Leu Val Pro Ser Arg He Ser Gly Arg Gin Ser Thr Arg Arg Ser 
195 200 205 

Val Thr Ser Val Arg Ser Met Arg Gly Lys Arg Lys Thr Arg Ala He 
210 215 220 

Pro Glu Phe Phe Asp Leu Thr Arg His Gin Asn He Arg Leu Arg Ala 
225 230 235 240 

Pro Ala Thr Lys Lys Lys Arg He Ser Leu His Arg Val Glu Asp Thr 

245 250 255 

Glu Val Val Val Glu Leu Leu Asn Gly Gin Lys Val Glu Val Ala Cys 
260 265 270 

Arg Ser Asp Vai He Ser Arg Asp Val Phe Ser Leu He Val Gin Asn 
275 280 285 

Met Asn He Asn Glu His Val Phe Phe Gly Leu Ser Phe Leu Arg Asp 
290 295 300 

Gly Glu His Tyr Phe He Glu Asp His Gin Arg Leu Glu Lys Phe Ala 
305 310 315 320 

Pro S^r- Gly Trp Lys Ser Val Ala Arg Val Gly Val Lys Val Pro Tyr 

325 330 335 

Val Leu His Leu Arg Phe Lys Phe Tyr Pro Gin He Leu Asp Phe He 
340 345 350 

Lys Thr Asp Val Thr Met Asn Glu Leu Tyr Leu Gin Cys Arg Arg Asp 
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355 360 365 

Val Leu Glu Glu Arg lie Gin Pro Lys Arg Asp Ala Ala Phe Glu Leu 
370 375 380 

Ala Ala Leu Ala Leu Gin Ala Glu Phe Gly Asn Arg Pro Pro Pro Val 
335 390 395 400 

He Tnr Asp Tvr Phe Asp He Gin His Tyr Leu Pro Lys Lys Tyr Ser 
405 410 415 

Ser Phe Glu Asp Gin Ser Arg Leu Lys Asn He Leu Ala Glu Leu His 
420 425 430 

Gly His Tyr Ala Gly 
435 



<210> 31 
<211> 2574 
<212> DNA 

<213> Caenorhabdi t is elegans 
<400> 31 

atgcctgcta atgagctttu cggagattct gatccggaag gggatgaaat ttttgaaaga 60 
gttcgaaagg tgcaaccaaa gtcgatcaat gtaactgaga atcaaaaggt agaccctatg 120 
cgtaaagtca aaatcgagct tcaagctgtt ttggttgcgg aaaaaatacc aattagtaca 180 
gaagaaatca gacggcgtcr gctggattct tatggtgcat gtcctgatcc gaaaagatat 240 
aactgttcga ctttggacga tcttcttcaa gcatgctcgg agtcaattgt acacactttc 300 
ggccgtgatg gaatacaccg atatgggcca agaactactg aagccaacca agatattatc 360 
gaaatggtcc agcagcagtc gagctcaaaa cgcccggccc gctcgttttt aggttcagga 420 
gctaccaata acctcagcac rcatgggcca tcattccggg cattcagagg tccgtatgcg 480 
ccagaggaaa tcgctaaatc gagaggaaca cctgagcaat tcaaagcaag acacaagttg 540 
ggtccagcaa aaacaatttc tcgcgtaaaa aaccttgcag aggttttgaa agaatatgct 600 
gatgagatag gagtttcaca tcctgatgag ccaaatcgca agattgtaac actggcagct 660 
cttgccaata agctcaaaca gttgtattgr ttaccagcat ggggaaagaa catatcggaa 720 
agtgaactat acattcagcr caatgttcct cctttcaacg aatatctgca tttctggcgt 780 
cttagcgaaa aaggrgacat cttcgttgat tgtattgatc gtgacaatgc cgatccaact 840 
caaaaaagtg aacaaaatcc gtcagcagat gtttctattc aatctgaatc ttttggcggt 900 
aaaagttcag cttcagcgtt tgaacaatct gtagtatccg ctccttcaac tattagagat 960 
caaacatccg attcctttga cgggttcaac agtttcgaag tgcccccaga aaatggaagc 1020 
aaagattcaa aaattttcaa ctcgaatcaa gaaagcatcg atgactatcc aggaaatgct 1080 
atatcucgag atcgaaccgc tgatatgacc gacattgcat tgcgctttgg aactgtctct 1140 
gtggcaagcc aacaatgtcc ggtatcttcg tcactcgttc cacaaaatgg aattcttcgt 1200 
cagtcgcgtg ctcaagaaga cgacaacaac acatctartc taactattca atcatctcgt 1260 
cgcaatcatt cagtgcttcg tcatcgtacg atcaagcctc gcaatccaac acaaaatctt 1320 
gctgaagtcg taaaaactca tggtagcatt ccttatgaag cgctttcgga ttgtgataag 1380 
attatcgtcg acttaggaaa gaacattttc aaagtttatg caactcaacc tggagaaatg 1440 
atggtccgcc tttgtgatcc ccacgttgac acgactacat tgccactcct agagaacaat 1500 
cttcgggatc ctgtcgagtc tgatttacgt tggatgacac tgggaaattc ccatatcaag 1560 
aaacaatctg ttaaagtggt caagcctgca atgtttattg cgccacgcgg atttttgttg 1620 
attttaaaag atgaagaacg tgaagaaatg gacgttgaaa aggttgctac tatggggaat 1680 
atattgcggg ccgtgatggt agcgcctatt gtggagctcc agcgtgaaac cgtccgcacg 1740 
ggatcagctg cagtctacgt ttatcgacaa ggagcggagg tgcgctatta tagagtgttg 1800 
atagttggac aagctaaaca agatggtgaa gtgttggttt tgttggctga tgtcgatgat 1860 
caatatttcg tggatgtaca tctatctcat ttgttcccga ttccggaaga agccagcttc 1920 
aaacactttc cctcgaatgt tgtattcgca acattgcatg gagtactggg tttgacgctt 1980 
tctgagcagg atgtgatgtt cgaaaacatt gataatgatg atacaaaacg atttgtgggt 2040 
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ggatattttc atggaaacga tgacagaatt cttaacatcg atatggtttg gaaaaatgag 2100 

. ., 4- ^. 4_„„^t- t- ^ -.i- f. ni-r-a;5rf3a3r at.aaaacaat tacttcatcc 2160 

gatgccaaca tcattcactt tccacattct gcactcgatg tgataaagtc tgttggacca ZZZO 
gattgttcgg tgtgctttgt cgactattca gttcgtgatg aatctgcaac atcttcattg 2280 
atggaatcga ctagaattgt tcatgatagt cgtgaatcta tgacaactac ttatgttggt 2340 
gagat-gccaa gcccaattat cgaagaaatc gatgcaacat cttcatttga cccaaaactc 2400 
tcgaatctcc attcattgtt cgataagtta atcgaggaac agaatgctac tatgattgtg 24o0 
gggatgttcc aatccgttcg aagtttgaag gatttattcg gcgataacaa rgaatgggaa 2520 
cgcttgctca cgtacatgct cacgactggt aaaaacaata acattcgtct ctga 2574 

<210> 32 
<211> 857 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 32 

Met Pro Ala Asn Glu Leu Phe Gly Asp Ser Asp Pro Glu Gly Asp Glu 
15 10 15 

He Phe Glu Arg Val Arg Lys Val Gin Pro Lys Ser He Asn Val Thr 
20 25 30 

Glu Asn Gin Lys Val Asp Pro Met Arg Lys Val Lys He Glu Leu Gin 
35 40 45 

Ala Val Leu Val Ala Glu Lys He Pro He Ser Thr Glu Glu He Arg 
50 55 60 

Arq "Arg Leu Leu Asp Ser Tyr Gly Ala Cys Pro Asp Pro Lys Arg Tyr 
65 70 75 80 

Asn Cys Ser Thr Leu Asp Asp Leu Leu Gin Ala Cys Ser Glu Ser He 

85 90 95 

Val His Thr Phe Gly Arg Asp Gly He His Arg Tyr Gly Pro Arg Thr 
100 105 110 

Thr Glu Ala Asn Gin Asp He He Glu Met Val Gin Gin Gin Ser Ser 
115 120 125 

Ser Lys Arg Pro Ala Arg Ser Phe Leu Gly Ser Gly Ala Thr Asn Asn 
130 135 140 

Leu Ser Thr His Gly Ser Ser Phe Arg Ala Phe Arg Gly Pro Tyr Ala 
145 150 155 160 

Ser Glu Glu He Ala Lys Ser Arg Gly Thr Pro Glu Gin Phe Lys Ala 

165 170 175 

Arg His Lys Leu Gly Pro Ala Lys Thr He Ser Arg Val Lys Asn Leu 
180 185 190 

Ala Glu Val Leu Lys Glu Tyr Ala Asp Glu He Gly Val Ser His Pro 
195 200 205 

Asp Glu Pro Asn Arg Lys He Val Thr Leu Ala Ala Leu Ala Asn Lys 
210 215 220 
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Phe Lys Gin Leu Tyr Cys Leu Pro Ala Trp Gly Lys Asn He Ser Glu 
225 230 235 240 

Ser Glu Leu Tvr He Gin Leu Asn Val Pro Pro Phe Asn Glu Tyr Leu 
245 250 255 

"is ^h^ Trp Ara Leu Ser Glu Lys Gly Asp He Phe Val Asp Cys He 
260 265 270 

Asp Arg Asp Asn Ala Asp Pro Thr Gin Lys Ser Glu Gin Asn Pro Ser 
275 280 285 

Ala Asp Val Ser 'He Gin Ser Glu Ser Phe Gly Gly Lys Ser Ser Ala 
290 295 300 

Ser Ala Phe Glu Gin Ser Val Val Ser Ala Pro Ser Thr He Arg Asp 
305 310 315 320 

Gin Thr Ser Asp Ser Phe Asp Gly Phe Asn Ser Phe Glu Val Pro Pro 
325 330 335 

Glu Asn Gly Ser Lys Asp Ser Lys He Phe Asn Ser Asn Gin Glu Ser 
340 345 350 

He Asp Asp Tyr Pro Gly Asn Ala He Ser Arg Asp Arg Thr Ala Asp 
355 360 365 

Met Thr Asp He Ala Leu Arg Phe Gly Thr Val Ser Val Ala Ser Gin 
370 375 380 

Gin Cys Pro Val Ser Ser Ser Leu Val Pro Gin Asn Gly He Leu Arg 
385 390 395 400 

Gin Ser Arg Ala Gin Glu Asp Asp Asn Asn Thr Ser He Leu Thr He 

405 410 415 

Gin Ser Ser Arg Arg Asn His Ser Val Leu Arg His Arg Thr He Lys 
420 425 430 

Pro Arg Asn Pro Thr Gin Asn Leu Ala Glu Val Val Lys Thr His Gly 
435 440 445 

Ser He Pro Tyr Glu Ala Leu Ser Asp Cys Asp Lys He He Val Asp 
450 455 460 

Leu Gly Lys Asn He Phe Lys VHl Tyr Ala Thr Gin Pro Gly Glu Met 
465 470 475 480 

Met Val Arg Leu Cys Asp Pro His Val Asp Thr Thr Thr Leu Pro Leu 

485 490 495 

Leu Glu Asn Asn Leu Arg Asp Pro Val Glu Ser Asp Leu Arg Trp Met 
500 505 510 

Thr Leu Gly Asn Ser His He Lys Lys Gin Ser Val Lys Val Val Lys 
515 520 525 

Pro Ala Met Phe He Ala Pro Arg Gly Phe Leu Leu He Leu Lys Asp 
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530 535 540 

Glu Glu Arg Glu Glu Met Asp Val Glu Lys Val Ala Thr Met Gly Asn 
545 " 550 555 560 

He Le'-- a^a Ala Val Met Val Ala Pro He Val Glu Leu Gin Arg Glu 

565 570 575 

n^hr V^l '^ra Thr Glv Ser Ala Ala Val Tyr Val Tyr Arg Gin Gly Ala 
- " " 580 ' 585 590 

Glu Val Arg Tvr Tyr Arg Val Leu He Val Gly Gin Ala Lys Gin Asp 
595 ' 600 605 

Gly Glu Val Leu Vai Leu Leu Ala Asp Val Asp Asp Gin Tyr Phe Val 
610 615 620 

ASD val His L-u Ser His Leu Phe Pro He Pro Glu Glu Ala Ser Phe 
625 630 635 640 

Lvs His Phe P'-c Ser Asr. Val Val Phe Ala Thr Leu His Gly Val Leu 

645 650 655 

Gly Leu Th- Leu Ser Giu Gin Asp Val Met Phe Glu Asn He Asp Asn 
660 665 670 

Asp Asp Th- Lvs Ara Phe Val Gly Gly Tyr Phe His Gly Asn Asp Asp 
675 ' 680 685 

Arg He Leu Asn He Asd Met Val Trp Lys Asn Glu Arg Gly Gin Phe 
690 695 700 

Glu TrD Leu Ser Gin He Val Lys Arg Arg Gly Ala Val Thr Ser Ser 
705 ' -^lO 715 720 

Asp Ala Asn He He His Phe Pro His Ser Ala Leu Asp Val He Lys 

725 730 735 

Ser Val Gly Pro Asd Cvs Ser Val Cys Phe Val Asp Tyr Ser Val Arg 
740 745 750 

2\sp Glu Ser Ala Thr Ser Ser Leu Met Glu Ser Thr Arg He Val His 
755 760 765 

Asp Ser Arg Glu Ser Met Thr Thr Thr Tyr Val Gly Glu Met Pro Ser 
770 775 780 

Pro -1» He Glu Glu He Asp Ala Thr Ser Ser Phe Asp Pro Lys Leu 
785 790 795 800 

Leu Asn Leu His Ser Leu Phe Asp Lys Leu He Glu Glu Gin Asn Val 

805 810 815 

Thr Met He Val Gly Met Phe Gin Phe Val Arg Ser Leu Lys Asp Leu 
820 825 830 

Phe Giv Asp Asn Asn Glu Trp Glu Arg Leu Leu Thr Tyr Met Leu Thr 
835 840 845 
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Thr Glv Lys Asn Asn Asn He Arg Leu 
850 ' 855 



<210> 33 



r/ionorhabdi tis eieaans 



u 3 



gagttcgaaa ggtigcaacca aagtcgarca atgtaactga gaarcaaaag 60 

tgcgtaaagt caaaarcgag ctrcaagctg ttttggtrgc ggaaaaaata 120 

cagaagaaat cagacggcgt ctgctggatt cttatggtgc atgrcctgat 180 

ataactgttc gactttggac gatcttcttc aagcatgctc ggag-caari: 240 

tcggccgtga tggaatacac cgatatgggc caagaactac tgaagccaac 300 

::cgaaatggt ccagcagcag rcgagctcaa aacgcccggc ccgcrcgrtt 360 

gagcractaa taacctcagc actcatgggt carcattccg ggcartcaga 420 

cgtcagagga aatcgcnaaa tcgagaggaa cacctgagca attcaaagca 480 

^.gggtccagc aaaaacaatt tcrcgcgtaa aaaaccttgc agaggttttg 540 

ctgatgagar. aggagtttca catcctgatg agccaaatcg caagattgra 600 

ctcttgccaa taagtrcaaa cagttgtatt gtttaccagc atggggaaag 660 

aaagtgaact atacattcag ctcaatgttc ctcctttcaa cgaaTiatctg 720 

z :^*^zzzz:]zz qtcttagcga aaaaggtgac atcttcgttg attgtattga tcgtgacaat 780 

rccgai-ccc: ctcaaaaaag tgaacaaaat ccgtcagcag atgtttictat tcaatctgaa 840 

tcitttggcg gtaaaagttc agcttcagcg tttgaacaat ctgtagtatc cqczccttca 900 

azzaztaqac, atcaaacatc cgartccttt gacgggttca acagtttcga agtgccccca 960 

gaaaarggaa gcaaagattc aaaaattttc aactcgaatc aagaaagcat cgatgactat 1020 

ccaggaaaig cratatctcg agatcgaacc gctgatatga ccgacattgc attgcgcttt 1080 

qqaazzqzcz ctgtggcaag ccaacaatgt ccggtatctt cgtcactcgt tccacaaaat 1140 

ggaatrcttc gtcagtcgcg tgctcaagaa gacgacaaca acacatctat tctaactatt 1200 

caatcacctc gtcgcaatca ttcagtgctt cgtcatcgta cgatcaagcc tcgcaatcca 1260 

acacaaaacc trgctgaagt cgtaaaaacr catggtagca ttccttatga agcgctttcg 1320 

cattgtgaia agartatcgt cgacttagga aagaacattt tcaaagttta tgcaactcaa 1380 

cctggagaaa tgatggtccg cctrtgtgar ccccacgttg acacgactac atrgccactc 1440 

ctagacaaca atcttcggga tcctgtcgag tctgatttac gttggatgac actgggaaat 1500 

zcccatazca agaaacaatc tgrtaaagtg gtcaagcctg caatgtttat cgcgccacgc 1560 
ggattittcr tgattttaaa agatgaa 1587 

<210> 34 
<211> 529 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 34 

lie Phe Giu Arg Val Arg Lys Val Gin Pro Lys Ser He Asn Val Thr 
15 10 15 

Giu Asn Gin Lys Val Asp Pro Met Arg Lys Val Lys He Giu Leu Gin 
20 25 30 

Ala Val Leu Val Ala Glu Lys He Pro He Ser Thr Glu Glu He Arg 
35 40 45 

Arg Arg Leu Leu Asp Ser Tyr Gly Ala Cys Pro Asp Pro Lys Arg Tyr 
50 55 60 



Asn Cys Ser Thr Leu Asp Asp Leu Leu Gin Ala Cys Ser Glu Ser He 
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65 70 -75 80 

Val His Thr Phe Gly Ara Asp Gly He His Arg Tyr Gly Pro Arg Thr 

85 ' 90 55 

Th^ Glu Ala Asn Gin Asp He He Glu Met Val Gin Gin Gin Ser Ser 
100 105 110 

Ser Lvs Arg Pro Ala Arg Ser Phe Leu Gly Ser Gly Ala Thr Asn Asn 
115 120 125 

Leu Ser Thr His Gly Ser Ser Phe Arg Ala Phe Arg Gly Pro Tyr Ala 
130 135 140 

S°r Glu Glu He Ala Lys Ser Arg Gly Thr Pro Glu Gin Phe Lys Ala 
145 150 155 160 

Arq His Lys Leu Gly Pro Ala Lys Thr He Ser Arg Val Lys Asn Leu 

165 170 175 

Ala Glu Val Leu Lys Glu Tvr Ala Asp Glu He Gly Val Ser His Pro 
leo 185 190 

Asp Glu Pro Asn Arg Lys He Val Thr Leu Ala Ala Leu Ala Asn Lys 
195 200 205 

Phe Lys Gin Leu Tyr Cys Leu Pro Ala Trp Gly Lys Asn He Ser Glu 
210 215 220 

Ser Glu Leu Tyr He Gin Leu Asn Val Pro Pro Phe Asn Glu Tyr Leu 
225 230 235 240 

His Phe Trp Arg Leu Ser Glu Lys Gly Asp He Phe Val Asp Cys He 

245 250 255 

Asp Arg Asp Asn Ala Asp Pro Thr Gin Lys Ser Glu Gin Asn Pro Ser 
260 265 270 

Ala Asp Val Ser He Gin Ser Glu Ser Phe Gly Gly Lys Ser Ser Ala 
275 280 285 

ser Ala Phe Glu Gin Ser Val Val Ser Ala Pro Ser Thr He Arg Asp 
290 295 300 

Gin Thr Ser Asp Ser Phe Asp Gly Phe Asn Ser Phe Glu Val Pro Pro 
305 310 315 320 

Glu Asn Gly Ser Lys Asp Ser Lys He Phe Asn Ser Asn Gin Glu Ser 

325 330 335 

He Asp Asp Tyr Pro Gly Asn Ala He Ser Arg Asp Arg Thr Ala Asp 
340 345 350 

Met Thr Asp He Ala Leu Arg Phe Gly Thr Val Ser Val Ala Ser Gin 
355 360 365 

Gin Cys Pro Val Ser Ser Ser Leu Val Pro Gin Asn Gly He Leu Arg 
370 375 380 
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Gin Ser Arg Ala Gin Glu Asp Asp Asn Asn Thr Ser lie Leu Thr lie 
385 390 395 400 

Gin Ser Ser Arg Arg Asn His Ser Vai Leu Arg His Arg Thr lie Lys 
405 410 415 

Pro Arg Asn Pro Thr Gin Asn Leu Ala Glu Vai Val Lys Thr His Gly 
420 425 430 

Ser lie Pro Tyr Glu Ala Leu Ser Asp Cys Asp Lys lie He Val Asp 
435 ' 440 445 

Leu Gly Lys Asn lie Phe Lys Val Tyr Ala Thr Gin Pro Gly Glu Met 
450 455 460 

Met Val Arg Leu Cys Asp Pro His Val Asp Thr Thr Thr Leu Pro Leu 
465 " 470 475 480 

Leu Glu Asn Asn Leu Arg Asp Pro Vai Glu Ser Asp Leu Arg Trp Met 
485 490 495 

Thr Leu Gly Asn Ser His He Lys Lys Gin Ser Val Lys Vai Vai Lys 
500 505 510 

Pro Ala Met Phe He Ala Pro Arg Gly Phe Leu Leu lie Leu Lys Asp 
515 520 525 

Glu 



<210> 35 
<211> 1593 
<2i2> DNA 

<213> Caenorhabdit i s eiegans 
<400> 35 

atgcgcattg tccgtacaca ccgcgacgag tttttacgga cattgtgctt gaacttgttt 60 
tgttgtctgt tgatcaactc aattgagaaa tcaaaacaaa ttcaatcttc agcgtatttt 120 
tttcgaaatt ctcactcttt cgccatcgaa aagttcaaaa gaaaacaaca aaaaatgcct 180 
cgcggtctac ggagagcaga tttagtcaaa cgacatcgcc actcaacgac aggagacaaa 240 
gacggaggag taccagaagt aataggatgc ccagttttag atcctattat ctgccaatgt 300 
ccaaaagatg agatcgagct tggtgaagga gtcaagatga cgtgcacttg ggaatcatgc 360 
ccgtactcta gtagaccact tcatcacata tgctatcaac tgctcgagga caatcttgtc 420 
aagcgattag cctcactggg aagtgcacga ggatggacag tgccacaacg gaggaataac 480 
ttatgggaga ggaagggtca gtccctgatc ggaaagttct gccgatgtcg ctgcgatcgg 540 
ggacaaatga ccagagacaa gcaggcttta tatgagaaag agaaggctgt ggaaaaagag 600 
aagaagaaga aggccaagaa agcaaaacaa ctgccccagc tacaatttaa ttctaaacct 660 
trggcagcta tcgaggagaa aaagcgagga gacgctgatg tattccactc accgtccatt 720 
gcctcaagta cacggcatca cacattctcg acgacgacac gatcgcgact tcatactgat 780 
cgttcggctt cttccatttt aacacacact attggaagaa cgtggtccga atcttcgttt 840 
gccggtgaaa caaatggtca gtacgacaac aatcaggagc cacatccatc aaattgtgaa 900 
tgcgtatttc atcacgatta cgacgctgac gatcaaatag atacggattt cgagtgtgaa 960 
agcaatcaca gcgacgtaat agttccagct ccacttccac cacttcaggc gaaaagctat 1020 
gcagcgacaa taatgagaaa cgggacaccg aaggttacaa attattcacc ggatagtggt 1080 
ctcgatcagc aaactccaag gttttcattg tcttcttcga gtggaggaga tgtcgataat 1140 
caacatggag acttccacgt ggaaactaga atttccgagc atctcaacgc gttgggactc 1200 
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agcataa-.gt cgccggtgga gaatgcgaat gaaaatgtca attatgaaga atcgccgt-.c 1260 

^r-r-^^rrnrr. tcaaaaaaqc aqcqqqaacc tctccgagcg iJ^U 

. - - - - - - , _ _ 

1440 
1500 



aaaaacaaca catctgtctc gaagcttcca cttgctccgt cgtcacagct attcaatga=. 

gaarcaca-t gtggattcag attcaatgtg ccggttcgcg aaatgatgga catatggcaa 

q^q-'-cc;gaa ccttgtcgcc ggcaattcga gaaacacagg ctgaaaatac tgaaaaaaga 

aczgacaatg cgtcgggtgt actccaatat ggatggactc cattcttcgg caatggctrc 1560 
cjitcu ;::gag agcgcctcca c'cs.cT.^ocoa Tiag 159,^ 



■ ' ■ ■ . r t^ T 

: : :• ■ raonorhabditis elegans 

- .^ val Arg Thr His Arg Asp Glu Phe Leu Arg Thr Leu Cys 
■ : •" ■ 5 10 15 

.. .-. ■ ■-• Phe Cys Cys Leu Leu He Asn Ser He Glu Lys Ser Lys 
" 20 25 3D 

• - ' Ser Ser Ala Tyr Phe Phe Arg Asn Ser His Ser Phe Ala 

35 ' 40 45 

- G _ -vs Phe Lys Arg Lys Gin Gin Lys Met Pro Arg Gly Leu Arg 
" W 55 60 

::,.c Leu Val Lys Arg His Arg His Ser Thr Thr Gly Asp Lys 
"65 ' 70 "75 80 

-■s- G' -- G"--- Val Pro Glu Val He Glv Cys Pro Val Leu Asp Pro He 
" ' 85 90 95 

He C--=^ Gin Cvs Pro Lys Asp Glu He Glu Leu Gly Glu Gly Val Lys 
100 105 110 

f.,^. .^,.3 -i-hr Trp Glu Ser Cys Pro Tyr Ser Ser Arg Pro Leu His 

115 " 120 125 

H's II- Cvs Tvr Gin Leu Leu Glu Asp Asn Leu Val Lys Arg Leu Ala 
li:- ' ^ 135 140 

3-r L=-- Glv Se- Ala Arg Glv Trp Thr Val Pro Gin Arg Arg Asn Asn 
145 " ' 150 ' 155 160 

L-u -'c Glu Arg Lys Gly Gin Ser Leu He Gly Lys Phe Cys Arg Cys 

165 170 I'^S 

i^-n C-.-= -.so Arg Gly Gin Met Thr Arg Asp Lys Gin Ala Leu Tyr Glu 
' ' 180 185 190 

Lys Glu Lvs Ala Val Glu Lys Glu Lys Lys Lys Lys Ala Lys Lys Ala 
195 200 205 

Lys G-^n Leu Pro Gin Leu Gin Phe Asn Ser Lys Pro Leu Ala Ala He 
2:0 215 220 

Glu Glu Lys Lys Arg Gly Asp Ala Asp Val Phe His Ser Pro Ser He 
225 ~ 230 235 240 
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Ala Ser Ser Thr Arg His His Thr Phe Ser Thr Thr Thr Arg Ser Arg 
245 250 255 

Leu His Thr Asp Arg Ser Ala Ser Ser lie Leu Thr His Thr lie Gly 
260 265 270 

Arg Thr Trp Ser Glu Ser Ser Phe Ala Gly Glu Thr Asn Gly Gin Tyr 
275 280 285 

Asp Asn Asn Gin Glu Pro His Pro Ser Asn Cys Glu Cys Val Phe His 
290 295 300 

His Asp Tyr Asp Ala Asp Asp Gin lie Asp Thr Asp Phe Glu Cys Glu 
305 310 315 320 

Ser Asn His Ser Asp Val lie Val Pro Ala Pro Leu Pro Pro Leu Gin 
325 330 335 

Ala Lvs Ser Tyr Ala Ala Thr He Met Arg Asn Gly Thr Pro Lys Val 
340 345 350 

Thr Asn Tyr Ser Pro Asp Ser Gly Leu Asp Gin Gin Thr Pro Arg Phe 
355 360 365 

Ser Leu Ser Ser Ser Ser Gly Gly Asp Val Asp Asn Gin His Gly Asp 
370 375 380 

Phe His Val Glu Thr Arg He Ser Glu His Leu Asn Ala Leu Gly Leu 
385 390 395 400 

Ser He Met Ser Pro Val Glu Asn Ala Asn Glu Asn Val Asn Tyr Glu 
405 410 415 

Glu Ser Pro Phe Tyr Pro Glu Leu Thr Ser Thr Pro He Val Ser Lys 
420 425 430 

Lys Gin Arg Glu Pro Leu Arg Ala Lys Lys Ser Thr Ser Val Ser Lys 
435 440 445 

Leu Pro Leu Ala Pro Ser Ser Gin Leu Phe Asn Glu Glu Ser Arg Cys 
450 455 460 

Gly Phe Arg Phe Asn Val Pro Val Arg Glu Met Met Asp He Trp Gin 
465 470 475 480 

Glu Ser Gly Ala Leu Ser Pro Ala He Arg Glu Thr Gin Ala Glu Asn 

485 490 495 

Thr Glu Lys Arg Ala Glu Asn Ala Ser Gly Val Leu Gin Tyr Gly Trp 
500 505 510 

Thr Pro Phe Phe Gly Asn Gly Phe Asn Leu Gly Glu Arg Leu Tyr Tyr 
515 520 525 



Phe Pro 
530 
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<210> 3 "7 
<211> 1458 
<212> DNA 

<213> Caenorhabditis elegans 



30 



<400> 37 

--ttucgcca tcgaaaagtt caaaagaaaa caacaaaaaa tgcctcgcgg tctacggaga 
qcagatttag tcaaacgaca tcgccactca acgacaggag acaaagacgg aggagtacca 120 
gaagtaacag gatgcccagt tttagatcct attatctgcc aatgtccaaa agatgagatc 180 
aagcttggtg aaggagtcaa gatgacgtgc acttgggaat catgcccgta ctctagtaga 240 
ccacttcatc acatatgcta tcaactgctc gaggacaatc ttgtcaagcg attagcctca 300 
ctgggaagtg cacgaggatg gacagtgcca caacggagga ataacttatg ggagaggaag 360 
agtcagcccc tgatcggaaa gttctgccga tgtcgctgcg atcggggaca aatgaccaga 420 
aacaaqcagg ctttatatga gaaagagaag gctgtggaaa aagagaagaa gaagaaggcc 480 
aagaaaccaa aacaactgcc ccagctacaa tttaattcta aacctttggc agctatcgag 540 
gagaaaaagc gaggagacgc tgatgtattc cacccaccgt ccattgcctc aagtacacgg 600 
catcacacac tctcgacgac gacacgatcg cgacttcata ctgatcgttc ggcttcttcc 660 
artttaacac acactattgg aagaacgtgg cccgaacctt cgtttgccgg tgaaacaaat 720 
caataca acaacaatca ggagccacat ccatcaaatt gtgaatgcgt atttcarcac 780 



aatcaigacg ccgacgatca aatagatacg gatttcgagt gtgaaagcaa tcacagcgac 840 
ctaatagtcc cagctccact tccaccactt caggcgaaaa gctatgcagc gacaataatg 900 
agaaacggga caccgaaggt tacaaattat tcaccggata gtggtcrcga tcagcaaact 960 



ccaaggttct cattgtcttc ttcgagtgga ggagatgtcg ataatcaaca tggagacttc 1020 

cacgtggaaa ctagaatttc cgagcatctc aacgcgttgg gactcagcat aatgtcgccg 1080 

gtggagaatg cgaatgaaaa tgtcaattat gaagaatcgc cgttctaccc ggagctgaca 1140 

tcgactccaa ccgtctcgaa gaagcagcgg gaacctctcc gagcgaaaaa gagcacatct 1200 

gtctcgaagc trccacttgc tccgtcgtca cagctattca atgaagaatc gcgttgtgga 1260 

ttcagattca atgtgccggt ncgcgaaatg atggacatat ggcaagagtc tggagccttg 1320 

tcgccggcaa ttcgagaaac acaggctgaa aatactgaaa aaagagctga gaatgcgtcg 1380 

ggtgtactcc aatatggatg gactccattc ttcggcaatg gcttcaatct cggagagcgc 1440 

ctctacract tcccatag 

<210> 38 
<211> 485 
<212> PRT 

<213> Caenorhabditis elegans 

<400> 38 ^ 

Ser Phe Ala lie Glu Lys Phe Lys Arg Lys Gin Gin Lys Me. Pro Arg 
15 10 15 

Gly Leu Arg Arg Ala Asp Leu Val Lys Arg His Arg His Ser Thr Thr 



20 



25 30 



Gly Asp Lys Asp Gly Gly Val Pro Glu Val He Gly Cys Pro Val Leu 



35 



40 45 



Asp Pro He He Cys Gin Cys Pro Lys Asp Glu He Glu Leu Gly Glu 



50 



55 60 



Gly Val Lys Met Thr Cys Thr Trp Glu Ser Cys Pro Tyr Ser Ser Arg 
65 70 75 80 

Pro Leu His His He Cys Tyr Gin Leu Leu Glu Asp Asn Leu Val Lys 

85 90 95 

Arg Leu Ala Ser Leu Gly Ser Ala Arg Gly Trp Thr Val Pro Gin Arg 
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100 105 110 

Arg Asn Asn Leu Trp Glu Arg Lys Gly Gin Ser Leu He Gly Lys Phe 
115 120 125 

Cys Arg Cvs Arg Cys Asp Arg Gly Gin Met Thr Arg Asp Lys Gin Ala 
130 ' 135 140 

Leu Tyr Glu Lys Glu Lvs Ala Val Glu Lys Glu Lys Lys Lys Lys Ala 
145 150 155 160 

Lys Lys Ala Lvs Gin Leu Pro Gin Leu Gin Phe Asn Ser Lys Pro Leu 

165 170 175 

Ala Ala He Glu Glu Lys Lys Arg Gly Asp Ala Asp Val Phe His Ser 
180 185 190 

Pro Ser He Ala Ser Ser Thr Arg His His Thr Phe Ser Thr Thr Thr 
195 200 205 

Arg Ser Arg Leu His Thr Asp Arg Ser Ala Ser Ser He Leu Thr His 
210 215 220 

Thr He Gly Arg Thr Trp Ser Glu Ser Ser Phe Ala Gly Glu Thr Asn 
225 230 235 240 

Gly Gin Tyr Asp Asn Asn Gin Glu Pro His Pro Ser Asn Cys Glu Cys 

245 250 255 

Val Phe His His Asp Tyr Asp Ala Asp Asp Gin He Asp Thr Asp Phe 
260 265 270 

Glu Cys Glu Ser Asn His Ser Asp Val He Val Pro Ala Pro Leu Pro 
275 280 285 

Pro Leu Gin Ala Lys Ser Tyr Ala Ala Thr He Met Arg Asn Gly Thr 
290 295 300 

Pro Lys Val Thr Asn Tyr Ser Pro Asp Ser Gly Leu Asp Gin Gin Thr 
305 310 315 320 

Pro Arg Phe Ser Leu Ser Ser Ser Ser Gly Gly Asp Val Asp Asn Gin 

325 330 335 

His Gly Asp Phe His Val Glu Thr Arg He Ser Glu His Leu Asn Ala 
340 345 350 

Leu Gly Leu Ser He Met Ser Pro Val Glu Asn Ala Asn Glu Asn Val 
355 360 365 

Asn Tyr Glu Glu Ser Pro Phe Tyr Pro Glu Leu Thr Ser Thr Pro He 
370 375 380 

Val Ser Lys Lys Gin Arg Glu Pro Leu Arg Ala Lys Lys Ser Thr Ser 
385 390 395 400 

Val Ser Lys Leu Pro Leu Ala Pro Ser Ser Gin Leu Phe Asn Glu Glu 

405 410 415 
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Ser Arc Cys Gly Phe Arg Phe Asn Val Pro Val Arq Glu Met Met Asp 
420 " 425 430 

lie Trp Gin Glu Ser Gly Ala Leu Ser Pro Ala He Arg Glu Thr Gin 



435 



440 



445 



Ala Glu Asn Thr Glu Lys Arg Ala Glu Asn Ala Ser Gly Val Leu Gin 
4 50 4 55 4 60 

Tvr Glv Trp Thr Pro Phe Phe Gly Asn Gly Phe Asn Leu Gly Glu Arg 

465 ' 470 475 480 



Leu Tvr Tyr Phe Pro 
485 



<210> 39 
<211> 1056 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 39 

atgcaaaaca cacagatatt tactaacttc gctcacagag cacatgatgg attaccgttt 6U 
aacagtgcga atccttccaa caaagatcca attttcacaa tgccaatatc ggtcaaaccg 120 
aaaactcgtg agccggcttc tttcacagat aacaggatat atgtcagcaa tattcccttc 180 
tcgtttcgtg aacaagattt ggcggcaatg ttcttcgcat atggaagagt cctgagtgtg 240 
gaaatcgtca caaatgatcg tggatccaaa gggttcgggt ttgtcacact cgattccatc 300 
gaatcctgtg agaaagctcg tgctgcgctt cacgaatcac atgttcaagg aagaattata 360 
gaagtgagaa gagcgacacc aacccgcaga aagcttatca acaatccaca aaatgaagtt 420 
ttgccaccac caaagctgtg tgtcgatctt cgagcccctc ataatttatg gagagctgag 480 
ccaatgcatc agttgttcaa ggaaaaggag aacacaacat gttttcccga agctggattc 540 
atgatggcac cataccgtag caatggaatt ttcaacacgc gtagtcttgt gcagaccaaa 600 
ccacctcgat gcaccaagca cagcgagctc aagctttctt cagctggtga atacttctgc 660 
aaaaacggcg agcctacgac ggaaacaagt attctgatgt gcatgcacag acaaaactca 720 
ccatgcagca ataagtgttc tgattcttcg aatcacgagc tgtctgatgt ggagttgaac 780 
tctatattcc cacatcatct tcgtgaccag attactgctc ttctcgacac ttcaaaccat 
tttggatcag gaaataatag tgctaacaaa ggaaagagag caccatctgt gacatcttct 
ggattgagat catcagagag cgagacagtt tcagacgaag agattcattg gtccccacat 
aacagccctg attatcttct cgctgctctc tacgaaggtt ccacatcgtt ccacggaaag 
tctgtttctc caccaaaaga atcgtcaagc cagtaa 

<210> 40 
<211> 351 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 40 

Met Gin Asn Thr Gin He Phe Thr Asn Phe Ala His Arg Ala His Asp 



840 
900 
960 
1020 
1055 



1 



5 10 15 



Glv Leu Pro Phe Asn Ser Ala Asn Pro Ser Asn Lys Asp Pro He Phe 
20 25 30 

Thr Met Pro He Ser Val Lys Pro Lys Thr Arg Glu Pro Ala Ser Phe 
35 40 45 



Th 



r Asp Asn Arg He Tyr Val Ser Asn He Pro Phe Ser Phe Arg Glu 
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50 55 60 

Gin Asp Leu Ala Ala Met Phe Phe Ala Tyr Gly Arg Val Leu Ser Val 
65 70 75 80 

Giu He Val Thr Asn Asp Arg Gly Ser Lys Gly Phe Gly Phe Val Thr 

85 90 95 

Leu Asp Ser He Glu Ser Cys Glu Lys Ala Arg Ala Ala Leu His Glu 
100 105 110 

Ser His Val Gin Gly Arg He He Glu Val Arg Arg Ala Thr Pro Thr 
115 120 125 

Arg Arg Lys Leu He Asn Asn Pro Gin Asn Glu Val Leu Pro Pro Pro 
130 135 140 

Lys Leu Cys Val Asp Leu Arg Ala Pro His Asn Leu Trp Arg Ala Glu 
145 150 155 160 

Pro Met His Gin Leu Phe Lys Glu Lys Glu Asn Thr Thr Cys Phe Pro 
165 170 175 

Glu Ala Gly Phe Met Met Ala Pro Tyr Arg Ser Asn Gly He Phe Asn 
180 185 190 

Thr Arg Ser Leu Val Gin Thr Lys Pro Pro Arg Cys Thr Lys His Ser 
195 200 205 

Glu Leu Lys Leu Ser Ser Ala Gly Glu Tyr Phe Cys Lys Asn Gly Glu 
210 215 220 

Pro Thr Thr Glu Thr Ser He Leu Met Cys Met His Arg Gin Asn Ser 
225 230 235 240 

Pro Cys Ser Asn Lys Cys Ser Asp Ser Ser Asn His Glu Leu Ser Asp 
245 250 255 

Val Glu Leu Asn Ser He Phe Pro His His Leu Arg Asp Gin He Thr 
260 265 270 

Ala Leu Leu Asp Thr Ser Asn His Phe Gly Ser Gly Asn Asn Ser Ala 
275 280 285 

Asn Lys Gly Lys Arg Ala Pro Ser Val Thr Ser Ser Gly Leu Arg Ser 
290 295 300 

Ser Glu Ser Giu Thr Val Ser Asp Glu Glu He His Trp Ser Pro His 
305 310 315 320 

Asn Ser Pro Asp Tyr Leu Leu Ala Ala Leu Tyr Glu Gly Ser Thr Ser 
325 330 335 

Phe His Gly Lys Ser Val Ser Pro Pro Lys Glu Ser Ser Ser Gin 
340 345 350 



<210> 41 
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<211> 1053 

<213> Caenorhabditis elegans 
< 4 0 0 > 41 

caaaacacac agatatttac taacttcgct cacagagcac atgatggatt accgtttaac 60 

agngcgaatc cttccaacaa aganccaatt ttcacaangc caatatcggt caaaccgaaa 120 

acrcgtgagc cggcttcttr cacagauaac aggatatatg tcagcaatat tcccttctcg 180 

zutcgtgaac aagattrggc ggcaargttc rtcgcatatg gaagagrcct gagtgtggaa 240 

atcgtcacaa atgatcgtgg atccaaaggg ttcgggtttg tcacactcga ttccatcgaa 300 

tcctgtgaga aagctcgtgc tgcgcrtcac gaatcacatg ttcaaggaag aattatagaa 360 

gtgagaagag cgacaccaac ccgcagaaag cttatcaaca atccacaaaa tgaagttttg 420 

ccaccaccaa agctgtgtgt cgatcttcga gcccctcata atttatggag agctgagcca 480 

argcatcagt tgttcaagga aaaggagaac acaacatgtt ttcccgaagc tggattcatg 540 

atggcaccat accgtagcaa tggaattttc aacacgcgta gtcttgcgca gaccaaacca 600 

cctcgatgca ccaagcacag cgagctcaag ctttcttcag ctggtgaara cttctgcaaa 660 

aacggcgagc ctacgacgga aacaagtatt ctgatgtgca tgcacagaca aaactcacca 720 

tgcagcaata agtgttctga ttcttcgaat cacgagctgt ctgatgtgga gttgaactct 780 

atattcccac atcatcttcg tgaccagatt actgctcttc tcgacactrc aaaccatttt 840 

ggatcaggaa ataatagtgc taacaaagga aagagagcac catctgrgac atcttctgga 900 

ttgagatcai: cagagagcga gacagtttca gacgaagaga ttcattggtc cccacataac 960 

agccctgart atcttctcgc tgctctctac gaaggttcca catcgttcca cggaaagtct 1020 

gtttctccac caaaagaatc grcaagccag taa 10 53 

<210> 42 
<211> 350 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 42 

Gin Asn Thr Gin lie Phe Thr Asn Phe Ala His Arg Ala His Asp Gly 
15 10 15 

Leu Pro Phe Asn Ser Ala Asn Pro Ser Asn Lys Asp Pro lie Phe Thr 
20 25 30 

Met Pro He Ser Val Lys Pro Lys Thr Arg Glu Pro Ala Ser Phe Thr 
35 40 45 

Asp Asn Arg lie Tyr Val Ser Asn He Pro Phe Ser Phe Arg Glu Gin 
50 55 60 

Asp Leu Ala Ala Met Phe Phe Ala Tyr Gly Arg Val Leu Ser Val Glu 
65 70 75 80 

He Val Thr Asn Asp Arg Glv Ser Lys Gly Phe Gly Phe Val Thr Leu 

85 90 95 

Asp Ser He Glu Ser Cys Glu Lys Ala Arg Ala Ala Leu His Glu Ser 
100 105 110 

His Val Gin Gly Arg He He Glu Val Arg Arg Ala Thr Pro Thr Arg 
115 120 125 

Arg Lys Leu He Asn Asn Pro Gin Asn Glu Val Leu Pro Pro Pro Lys 
130 135 140 



Leu Cys Val Asp Leu Arg Ala Pro His Asn Leu Trp Arg Ala Glu Pro 
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145 150 155 160 

Met His Gin Leu Phe Lvs Glu Lvs Glu Asn Thr Thr Cys Phe Pro Glu 

165 ^ 170 175 

Ala Glv Phe Met Met Ala Pro Tyr Arg Ser Asn Gly He Phe Asn Thr 

13C 185 190 

Arg Ser Leu Val Gin Thr Lys Pro Pro Arg Cys Thr Lys His Ser Glu 



195 200 205 



Leu Lys Leu Ser Ser Ala Gly Glu Tyr Phe Cys Lys Asn Gly Glu Pro 
210 215 220 

Thr Thr Giu Thr Ser lie Leu Met Cys Met His Arg Gin Asn Ser Pro 
225 230 235 240 

Cys Ser Asn Lvs Cvs Ser Asp Ser Ser Asn His Glu Leu Ser Asp Val 

245 250 255 

Glu Leu Asn Ser He rr.^-r Pro His Hxs Leu Arg Asp Gin lie Thr Ala 
2£: 265 270 

Leu Leu Asp Tnr Ser Asn His Phe Gly Ser Gly Asn Asn Ser Ala Asn 
275 280 285 

Lys Gly Lys Ara Ala Pro Ser Val Thr Ser Ser Gly Leu Arg Ser Ser 
290 ' 295 300 

Glu Ser Glu Thr Val Ser Asp Glu Glu He His Trp Ser Pro His Asn 
305 310 315 320 

Ser Pro Asp Tvr Leu Leu Ala Ala Leu Tyr Glu Gly Ser Thr Ser Phe 

325 330 335 

His Gly Lys Ser Val Ser Pro Pro Lys Giu Ser Ser Ser Gin 
340 345 350 



<210> 43 
<211> 1349 
<212> DNA 

<213> Caenorhabdi tis eiegans 
<400> 43 

aagtttgagg taacaaaaat gccgccgcca gctccgtucg aggggcaaaa gaacaaaggg 60 

cacaatgtcc acaaaactgg aggaaaagca anaaagcgaa atgagaagga gccgagagtc 120 

aagggaaaca atctgaaagc tttcactttc cactccgccg tatccgctgg aaaagcgatt 180 

cgacgagctg cagarctcaa tgaaaagaag aaacatgtrc tgatgargga cagaaaaccc 240 

atcgaaacac caccaatcat tgtagcaatc gttggaccga gtaaagtcgg aaaaacgaca 300 

cttctccggg gtctcgtcaa gtattacctc cgtgarggat tcggagagat caatggtcca 360 

gtgacaattg taactggaaa gaaacgtcgt gtacagttca ttgaggtcaa aaacgatatt 420 

aatcatatga ttgatatcgc gaaagtcgca gatttggtgc ttctaatggt cgatgcatcg 480 

tatggatttg aaatggaaac ctttgaattt ctaaatattt gccaagtgca cggaatgcca 540 

cgtattatgg gagtattgaa tcatttggat cttctcgatg gaatctcacg tgtcaataag 600 

accaagaaaa ttctgaaaca tcgtttctgg acggagctct accagggcgc gaagcttttc 660 

tacatgactg gaatgatgca tggacagtat aaatataatg agatccataa cctctgcaga 720 

ttcatttctg t-catgaaatt ccgtccgatg gtgtggaaag atgctcatcc atacgttctt 780 
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tgtgs-cgtt tcgaagacat taccaacgtc gaaactcttc gaacggatcc actcatcgat 840 

cgacacactg ccatgtatgg atgggttr.at ggtgctcatt tgaagaatca ttcgtcgatt 900 

catgtgccag gtgttggtga tatgaggatc agtaatgtca cgagtctacc cganccgtgt 950 

ccgctgcctg atgagattaa gaaacgagcg ttgaatgaga aagagcggaa agtgtatgct 1020 

ccgttrtctg gattaggagg tgtcatttat gataaggatg cgatttatat tgagtcaaag i080 

aatgczcaca attttaatag aaaacgcgac ggactcgttg aagctctcga aggcgtcaag 1140 

tcaggaaccg atgataaatt gaagaaatcc tctccgcaac ttctcggtga ttcagtagca 1200 

cttga^attg atcaggaaag tgattggcca gagcctggag aagaagatga agaagacctg 1260 

gatgaggagg attttcagga tgaagaagaa gatgaagatg aggatgagga tgaggaagat 1320 
gttggrgtcg tgaaaaagga aggtgract 1349 

<210> 44 

<211> 449 

<212> PRT 

<213> Caenorhabditis elegans 

<400> 44 

Lvs Phe Glu Val Thr Lys Met Pro Pro Pro Ala Pro Phe Glu Gly Gin 
'l 5 10 15 

Lys As-i Lys Gly His Asn Val His Lys Thr Gly Gly Lys Ala Xaa Lys 
20 25 30 

Arg Asn Glu Lys Glu Pro Arg Val Lys Gly Asn Asn Leu Lys Ala Phe 
35 40 45 

Thr Phe His Ser Ala Val Ser Ala Gly Lys Ala lie Arg Arg Ala Ala 
50 55 60 

Asp Leu Asn Glu Lys Lys Lys His Val Leu Met Met Asp Arg Lys Pro 
65 70 -75 80 

lie Glu Thr Pro Pro lie He Val Ala He Val Gly Pro Ser Lys Val 

85 90 95 

Gly Lvs Thr Thr Leu Leu Arg Glv Leu Val Lys Tyr Tyr Leu Arg Asp 
100 105 110 

Gly P-.e Gly Glu He Asn Gly Pro Val Thr He Val Thr Gly Lys Lys 
115 120 125 

Arg Arg Val Gin Phe He Glu Val Lys Asn Asp He Asn His Met He 
130 135 140 

Asp He Ala Lys Val Ala Asp Leu Val Leu Leu Met Val Asp Ala Ser 
145 150 155 160 

Tyr Gly Phe Glu Met Glu Thr Phe Glu Phe Leu Asn He Cys Gin Vai 

165 170 175 

His Gly Met Pro Arg He Met Gly Val Leu Asn His Leu Asp Leu Leu 
180 185 190 

Asp Gly He Ser Arg Val Asn Lys Thr Lys Lys He Leu Lys His Arg 
195 200 205 

Phe T-p Thr Glu Leu Tyr Gin Gly Ala Lys Leu Phe Tyr Met Thr Gly 
210 215 220 
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Met Mer His Gly Gin Tyr Lys Tyr Asn Glu lie His Asn Leu Cys Arg 
225 230 235 240 

?he lie Ser Val Met Lys Phe Arg Pro Met Val Trp Lys Asp Ala His 

245 250 255 

Fro ivr Val Leu Cys Asp Arg Phe Glu Asp lie Thr Asn Val Glu Thr 
260 265 ' 270 

^ Arr: Thr Asp Pro Leu lie Asp Arg His lie Ala Met Tyr Gly Trp 
275 280 285 

Gly Ala His Leu Lys Asn His Ser Ser lie His Val Pro Gly 

2 95 300 

g:-. Asp Met Arg lie Ser Asn Val Thr Ser Leu Pro Asp Pro Cys 
J : ' 310 315 320 

::: lzz> Asp Glu lie Lys Lys Arg Ala Leu Asn Glu Lys Glu Arg 

325 330 335 

./.•5 Val Tvr Ala Pro Phe Ser Gly Leu Gly Gly Val lie Tyr Asp Lys 
340 345 350 

Asp Ala lie Tyr lie Glu Ser Lys Asn Ala His Asn Phe Asn Arg Lys 
355 ^ 360 365 

Arg Asp Gly Leu Val Glu Ala Leu Glu Gly Val Lys Ser Gly Thr Asp 
370 ^ 375 380 

Asp Lys Leu Lys Lys Ser Ser Leu Gin Leu Leu Gly Asp Ser Val Ala 
365 ^ 390 395 400 

Leu Asp lie Asp Gin Glu Ser Asp Trp Pro Glu Pro Gly Glu Glu Asp 

405 ' 410 415 

Glu Glu Asp Leu Asp Glu Glu Asp Phe Gin Asp Glu Glu Glu Asp Glu 
420 425 430 

Asp Glu Asp Glu Asp Glu Glu Asp Val Gly Val Val Lys Lys Glu Gly 
435 ' 440 445 

Val 



<210> 45 
<211> 3423 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 45 

atgtcgtacc gcgtgttttc acgcgtcgat ccccatgtgc catcgacatc cacagctcca 60 
cagcgtcgtt ttcaagaaaa cttgcaaaaa ttcaagcgat tttttccgcc aaacgcacca 120 
ccaacactct gggaagtttg ctcctcgaaa caacgcagca aatcattgaa aaacacgttt 180 
caaacggaag tacgtgcact acgaggactt aattttacag tattgctgaa tccgtacaaa 240 
aactatctca atgatctcac aaatctatcc ggtttcacct tcgatgatct ttgtcaagca 300 
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cttcgattct ttgcatttra tagaaaacag 
.^^r.gaattat ttcgattaar tgcaagttqc 
tccatcgata aatcaacact agtggatacg 
cttaagaatc tgaaacaaga atcatctgaa 
caagatgatg agctaccttc tgttccaata 
gtaccaattc ctccgcctcc ttgtagaaat 
gaagatgaac atttacgatc aatgcacctg 
trcaatgggc cattrgagag aactatcaaa 
tccaaagatc gacgattgag cgargtggaa 
acacatatca gttttagatt agattctgtg 
attgaagatg atggatattt ctccaaagaa 
aattatcaaa ctgctcttca tgaatgcaca 
ggacgtacaa tcaagccggg attcgttgaa 
tttcaratga tgggacgaca tatrcatgga 
ggacggagag aarcaaatgg aaaatgggtc 
catattatgt ttccattatt cgtaaaatgg 
tttgataagt acagcttgta tggattaaga 
gaattgttga atgcrggaga gaacttattt 
aatagaratg ataiigagcac tacgaaagga 
gagaataatc gagttgtgga caagcatatg 
gaggacggaa gaaggagaaa gaggaaaccg 
aaagtacgaa ca cc tcgaaa agr tcttgag 
tcttcatcaa ctccaaaa ca acgcgcccrg 
atgattaatc aagtgccacc agttgraaat 
caactcaaaa agacgccaac ttcggctaca 
ttgaagtcgg aatctgtccc agctccacca 
agttgggatg atcatgttgg taattcgatg 
gagaaaatgg aagattcaga aggaagacag 
gaatatgcac ctgaacaata tgcgcgagat 
gatcatccac ctcaaccaca acaaccacca 
agacggaaat ccggtcagaa aactgatcaa 
tccgatcctg caccgcctgt rcctgctgct 
atrgttccag ttcatcctgt acctttgcca 
acgattgcaa agacaacagc acgagtactg 
gcgaacaaaa ctgtgtatcc tgtcaagaag 
attttaaatg gaaatactgc aactgcaagt 
agaaatgcat atacattttc artacagcaa 
ccgaaaccaa ctcaaaatgg aartgaaaca 
gcagccttct atagtgagag tcatcctaca 
cctccagttg ccacatcatc tacttttgta 
cgagcaatag ctcatcagaa gcaaatgctc 
ccgtttaata ttcaaatggc attcaaacca 
ggtgttttgg ctcattcaaa ttcaacaata 
acacaatcag actttcggcc gagaagcttc 
gctccrgttc caaatgcaat aaagagaaga 
gtgccatgga tcgcaaatag cagggcgttg 
acggctgaaa agatgtcttc gagcccgtta 
atgccaactc ctcctcctcc ggctccggct 
gcaactccag cattggtgga tacaatatcg 
ggcgattcaa atcagtcaaa tcctccttta 
cctggaacaa cactgacacc tgaagaaatc 
tttcaagagg atggtgatga agggccaaca 
tga 



67 

ccagttttga agtcaaatat ggaagatgct 360 
atcatttatt caaatgataa ctggagggcg 420 
ctgtcaarga acattttgga gaagcagagg 48U 
caaaaagatc caatataccc accactattc 540 
caaattggca gattgaaaga cagagaaaaa 600 
gatttttcga tgcgacaatt taatccgtrg 660 
tggaatcatg ttgggtgtaa tgatgcaaaa 720 
atgatgtcta agaataatgt tgcgattcgc 780 
tattacggag acaatgaaga tttaccatcc 840 
atgcaattga ttaactttga tttccccaaa 900 
tgtctggata gtgcatggta nctctacgag 960 
actgcrttcg cagttatccg acctccatct 1020 
gatggcctaa ctacagacga atgttcagaa 1080 
tttttccaag tatggaggga agaggatcgt 1140 
ccacggcgat atttggtgga tatctacaat 1200 
gaactttggc cgtcaacatt gaaatgggct 1260 
ctaatgtcaa tgattagaag acatccacaa 1320 
tcgaggtatc catcgcattt gttggaatcg 1380 
cggaatcagt atttatcagc aattcaaatg 1440 
cacagttcgg cttacaagct tcttatcgaa 1500 
aaggatgaag ctctactcgg agttgctgca 1560 
ccacctctat ttgctccaac tcgatttatt 1620 
cttgttcaaa aagagaattt ggagaaaaca 1680 
acaccaccat ccccacaaca gactgcttca 1740 
aaacgtcatt taccggaaat tgagcaagaa 1800 
ccaaccaaga aaatgtcaat tatagctgac 1860 
gaagaagaac atgtagatga gaaggattcg 1920 
aatgtttggg ttccacagga tcgaggaaaa 1980 
attatcgaac attatattcc tgctgctcga 2040 
cctccactac caacgccgaa gcctccacga 2100 
acgactccat catcagacgc cgaagcttca 2160 
ccagtggctc ctgttgttcc aattgttcca 2220 
aacggaagtg taaatactcc aaaagtgaag 2280 
tattccatta aacctcaaat accaccaatt 2340 
ttgacacctt ctgtagttcc gtctccaatg 2400 
ccatcgaaaa atgcagcatc tgtagttgtc 2460 
aaagctccat attatccagc tggaatgcgt 2520 
cctccgacag gagcacaatc tttaatgcgt 2580 
cgaagtcccc tcgtcccgta tggttttgtt 2640 
cctgctgcaa caataccatc acctgctagc 2700 
ttaaatacgg aaacgtgtcg acgcgttatg 2760 
cgtcgttggg atcctcttcc gaaatcttct 2820 
ccatatgttc agcgagtccc caacaactca 2880 
agtcaaaata gtgttgcttc cccggctcct 2940 
gaagtaggga atttaaaatc tcggcagtat 3O00 
gtggcagcgg cgatggcaac gatggaggag 3060 
ctatcgtcac aggctccgat gacaactttg 3120 
cctgctcaag catctgcaca atcaacatca 3180 
gctggttcaa cgactacaga aacaactaca 3240 
cggacgtaca catcacacat tcgaaaaact 3300 
ggcgacgcga ttcgaactga aagtcaaaga 3360 
gtgaaaagct tcctaatgaa catctacaaa 3420 

3423 



<210> 46 
<211> 1140 
<212> PRT 

<213> Caenorhabditis elegans 
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<400> 46 

Met Ser Tyr Arg Val • Phe Ser Arg Val Asp Pro His Val Pro Ser Thr 
15 10 15 

Ser Thr Ala Pro Gin Arg Arg Phe Gin Glu Asn Leu Gin Lys Phe Lys 
20 25 30 

Arg Phe Phe Pro Pro Asn Ala Pro Pro Thr Leu Trp Glu Val Cys Ser 
35 40 45 

Ser Lys Gin Arg Ser Lys Ser Leu Lys Asn Thr Phe Gin Thr Glu Val 
50 55 60 

Arg Ala Leu Arg Gly Leu Asn Phe Thr Val Leu Leu Asn Pro Tyr Lys 
6 5 70 75 80 

Asn Tyr Leu Asn Aso Leu Thr Asn Leu Ser Gly Phe Thr Phe Asp Asp 

85 90 95 

Leu Cys Gin Ala Leu Arg Phe Phe Ala Phe Tyr Arg Lys Gin Pro Val 
100 105 110 

Leu Lys Ser Asn Met Glu Asp Ala Asn Glu Leu Phe Arg Leu lie Ala 
115 120 125 

Ser Cys lie lie Tyr Ser Asn Asp Asn Trp Arg Ala Ser lie Asp Lys 
130 135 140 

Ser Thr Leu Val Asd Thr Leu Ser Met Asn lie Leu Glu Lys Gin Arg 
145 ^ 150 155 160 

Leu Lys Asn Leu Lvs Gin Glu Ser Ser Glu Gin Lys Asp Pro lie Tyr 

165 170 175 

Pro Pro Leu Phe Gin Asp Asp Glu Leu Pro Ser Val Pro lie Gin lie 
180 185 190 

Giy Arg Leu Lys Asp Arg Glu Lys Val Pro lie Pro Pro Pro Pro Cys 
195 ^ 200 205 

Arg Asn Asp Phe Ser Met Arg Gin Phe Asn Pro Leu Glu Asp Glu His 
210 215 220 

Leu Arg Ser Met His Leu Trp Asn His Val Giy Cys Asn Asp Ala Lys 
225 230 ^ 235 240 

Phe Asn Gly Pro Phe Glu Arg Thr 'lie Lys Met Met Ser Lys Asn Asn 
245 250 255 

Vai Ala lie Arg Ser Lys Asp Arg Arg Leu Ser Asp Val Glu Tyr Tyr 
260 265 270 

Giy Asp Asn Glu Asp Leu Pro Ser Thr His lie Ser Phe Arg Leu Asp 
275 280 285 

Ser Vai Met Gin Leu lie Asn Phe Asp Phe Pro Lys lie Glu Asp Asp 
290 295 300 
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Glv Tvr Phe Ser Lys Glu Cys Leu Asp Ser Ala Trp Tyr Leu Tyr Glu 
305 " " 310 315 320 

Asn Tyr Gin Thr Ala Leu His Glu Cys Thr Thr Ala Phe Ala Val He 

325 330 335 

Fro Pro Ser Glv Arg Thr He Lys Pro Gly Phe Val Glu Asp Gly 
340 ^ 345 350 

1- - T.'.r Tnr Asp Glu Cys Ser Glu Phe His Met Met Gly Arg His He 
355 ^ 360 365 

Z' - Pne Phe Gin Val Trp Arg Glu Glu Asp Arg Gly Trp Arg Glu 
y': 375 380 

. • "":v Lys Trp Val Pro Arg Arg Tyr Leu Val Asp He Tyr Asn 

390 395 400 

-.-c v..r Phe Pro Leu Phe Val Lys Trp Glu Leu Trp Pro Ser Thr 

405 410 415 

. Lv^ Tro Aia Phe Asd Lys Tyr Ser Leu Tyr Gly Leu Arg Leu Met 
420 ' ^ 425 430 

Sor Met He Arg Arg His Pro Gin Glu Leu Leu Asn Ala Gly Glu Asn 
^"'435 440 445 

Leu Phe Ser Arg Tyr Pro Ser His Leu Leu Glu Ser Asn Arg Tyr Asp 
450 455 460 

r-^'O- Ser Thr Lys Gly Arg Asn Gin Tvr Leu Ser Ala He Gin Met 

465 470 475 480 

Glu Asn Asn Arg Val Val Asd Lys His Met His Ser Ser Ala Tyr Lys 

485 490 495 

Leu Leu He Glu Glu Asp Glv Arg Arg Arg Lys Arg Lys Pro Lys Asp 
500 505 510 

Glu Ala Leu Leu Gly Val Ala Ala Lys Val Arg Thr Pro Arg Lys Val 
515 520 525 

Leu Glu Pro Pro Leu Phe Ala Pro Thr Arg Phe He Ser Ser Ser Thr 
530 535 540 

Pro Lvs Gin Arg Ala Leu Leu Val Gin Lys Glu Asn Leu Glu Lys Thr 
545 ' 550 555 560 

Ti^ Asp Gin Val Pro Pro Val Val Asn Thr Pro Pro Ser Pro Gin 
565 570 575 

Gin Trr Ala Ser Gin Leu Lys Lys Thr Pro Thr Ser Ala Thr Lys Arg 
580 585 590 

His Leu Pro Glu He Glu Gin Glu Leu Lys Ser Glu Ser Val Pro Ala 
595 600 605 
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Pro Pro Pro Thr Lvs Lvs Met Ser lie He Ala Asp Ser Trp Asp Asp 
610 ' 615 620 

His Val Gly Asn Ser Met Glu Glu Glu His Val Asp Glu Lys Asp Ser 
625 630 635 640 

Glu Lvs Met Glu Asp Ser Glu Gly Arg Gin Asn Val Trp Val Pro Gin 

645 650 655 

Asp Arg Gly Lys Glu Tyr Ala Pro Glu Gin Tyr Ala Arg Asp He He 
660 665 670 

Glu His Tyr He Pro Ala Ala Arg Asp His Pro Pro Gin Pro Gin Gin 
675 680 685 

Pro Pro Pro Pro Leu Pro Thr Pro Lys Pro Pro Arg Arg Arg Lys Ser 
690 695 700 

Gly Gin Lys Thr Asp Gin Thr Thr Pro Ser Ser Asp Ala Glu Ala Ser 
705 710 715 720 

Ser ASD Pro Ala Pro Pro Val Pro Ala Ala Pro Val Ala Pro Val Val 

725 730 735 

Pro He Val Pro He Val Pro Val His Pro Val Pro Leu Pro Asn Gly 
740 745 750 

Ser Val Asn Thr Pro Lys Val Lys Thr He Ala Lys Thr Thr Ala Arg 
755 760 765 

Val Leu Tyr Ser He Lys Pro Gin He Pro Pro He Ala Asn Lys Thr 
770 775 780 

Val Tyr Pro Val Lys Lys Leu Thr Pro Ser Val Val Pro Ser Pro Met 
785 790 795 800 

He Leu Asn Gly Asn Thr Ala Thr Ala Ser Pro Ser Lys Asn Ala Ala 

805 810 815 

Ser Val Val Val Arg Asn Ala Tyr Thr Phe Ser Leu Gin Gin Lys Ala 
820 825 830 

Pro Tyr Tyr Pro Ala Gly Met Arg Pro Lys Pro Thr Gin Asn Gly He 
835 840 845 

Glu Thr Pro Pro Thr Gly Ala Gin Ser Leu Met Arg Ala Ala Phe Tyr 
850 855 860 

Ser Glu Ser His Pro Thr Arg Ser Pro Leu Val Pro Tyr Gly Phe Val 
865 870 875 880 

Pro Pro Val Ala Thr Ser Ser Thr Phe Val Pro Ala Ala Thr He Pro 

885 890 895 

Ser Pro Ala Ser Arg Ala He Ala His Gin Lys Gin Met Leu Leu Asn 
900 905 910 

Thr Glu Thr Cys Arg Arg Val Met Pro Phe Asn He Gin Met Ala Phe 
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915 920 925 

Lys Pro Arg hrg Trp Asp Pro Leu Pro Lys Ser Ser Gly Val Leu Ala 
930 935 940 

His Ser Asn Ser Thr lie Pro Tyr Val Gin Arg Val Pro Asn Asn Ser 
945 950 955 960 

Trr Gir Ser Asp Phe Arc Pro Arg Ser Phe Ser Gin Asn Ser Val Ala 

965 970 975 

Ser Pro Ala Pro Ala Pro Val Pro Asn Ala . He Lys Arg Arg Glu Val 
980 985 990 

Glv Asn Leu Lys Ser Arg Gin Tyr Val Pro Trp He Ala Asn Ser Arg 
995 1000 1005 

Ala Leu Val Ala Ala Ala Met Ala Thr Met Glu Glu Thr Ala Glu Lys 
1010 1015 1020 

Met '^er S^r S^^- Pro Leu Leu Ser Ser Gin Ala Pro Met Thr Thr Leu 
{025^ 1030 1035 1040 

Met Pro Thr Pro Pro Pro Pro Ala Pro Ala Pro Ala Gin Ala Ser Ala 
1045 1050 1055 

Gin Ser Thr Ser Ala Thr Pro Ala Leu Val Asp Thr He Ser Ala Gly 
1060 1065 1070 

Ser Thr Thr Thr Glu Thr Thr Thr Gly Asp Ser Asn Gin Ser Asn Pro 
1075 1080 1085 

Pro Leu Arg Thr Tyr Thr Ser His He Arg Lys Thr Pro Gly Thr Thr 
1090 1095 1100 

Leu Thr Pro Glu Glu He Gly Asp Ala He Arg Thr Glu Ser Gin Arg 
1105 1110 1115 1120 

Phe Gin Glu Asp Gly Asp Glu Gly Pro Thr Val Lys Ser Phe Leu Met 
1125 1130 1135 



Asn He Tyr Lys 
1140 



<210> 47 
<211> 1644 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 47 

ttaccggaaa rtgagcaaga attgaagtcg 

aaaatgtcaa ttatagctga cagttgggat 

catgtagatg agaaggattc ggagaaaatg 

gttccacagg atcgaggaaa agaatatgca 

cattatattc ctgctgctcg agatcatcca 

ccaacgccga agcctccacg aagacggaaa 

tcatcagacg ccgaagcttc atccgatcct 



gaatctgtcc cagctccacc accaaccaag 60 

gatcatgttg gtaattcgat ggaagaagaa 120 

gaagattcag aaggaagaca gaatgtttgg 180 

cctgaacaat atgcgcgaga tattatcgaa 240 

cctcaaccac aacaaccacc acctccacta 300 

tccggtcaga aaactgatca aacgactcca 360 

gcaccgcctg ttcctgctgc tccagtggct 420 
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cctgttgttc 
gtaaatactc 
aaacctcaaa 
tctgtagttc 
aatgcagcat 
tattatccag 
ggagcacaa t 
czcgtcccgn 
acaaracca:: 
gaaacgtgtc 
gatcctcttc 
cagcgagtcc 
agtgttgctt 
aatttaaaat 
gcgat-ggcaa 
caggctccga 
gcatctgcac 
acgactacag 
acatcacaca 
attcgaactg 
ttcctaatga 



caattgttcc 
caaaagtgaa 
taccaccaat 
cgtctccaat 
ctgtagttgt 
ctggaatgcg 
ctrtaatgcg 
atggttttgu 
caccrgctag 
gacgcgttat 
cgaaatcttc 
ccaacaactc 
ccccggctcc 
ctcggcagta 
cgatggagga 
tgacaacttt 
aatcaacatc 
aaacaactac 
ttcgaaaaac 
aaagtcaaag 
acatctacaa 



aattgttcca 
gacgattgca 
tgcgaacaaa 
gattttaaat 
cagaaatgca 
tccgaaacca 
tgcagccctc 
tcctccagrt 
ccgagcaata 
gccgtttaac 
tggtgttttg 
aacacaatca 
tgctcctgtt 
tgtgccatgg 
gacggctgaa 
gatgccaact 
agcaactcca 
aggcgattca 
tcctggaaca 
atttcaagag 
azga 



12 

gttcatcctg 
aagacaacag 
actgtgtatc 
ggaaatactg 
tatacatttr 
actcaaaatg 
tatagtgaga 
gccacatcat 
gctcatcaga 
attcaaatgg 
gctcattcaa 
gactttcggc 
ccaaatgcaa 
atcgcaaata 
aagatgtctt 
cctcctcctc 
gcattggtgg 
aatcagtcaa 
acactgacac 
gatggtgatg 



tacctttgcc 
cacgagtact 
ctgtcaagaa 
caactgcaag 
cattacagca 
gaattgaaac 
gtcat cctac 
ctacttttgt 
agcaaargct 
cattcaaacc 
at t caacaat 
cgagaagctt 
taaagagaag 
gcagggcgtt 
cgagcccgtt 
cggctccggc 
atacaatatc 
atcctccttt 
ctgaagaaat 
aagggccaac 



aaacggaagt 480 
gtattccatt 540 
gttgacacct 600 
tccatcgaaa 660 
aaaagctcca 720 
acctccgaca 780 
acgaagtccc 840 
acctgctgca 900 
cttaaaracg 960 
acgtcgttgg 1020 
accatatgtt 1080 
cagtcaaaat 1140 
agaagtaggg 12 00 
ggtggcagcg 12 60 
acta teg tea 1320 
tcctgctcaa 1380 
ggctggttca 1440 
acggacgtac 1500 
cggcgacgcg 1560 
agtgaaaagc 1620 
1644 



<210> 48 
<211> 547 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 48 

Leu Pro Glu lie Glu Gin Glu Leu Lys Ser Glu Ser Val Pro Ala Pro 
15 10 15 

Pro Pro Thr Lys Lys Met Ser He He Ala Asp Ser Trp Asp Asp His 
20 25 30 

Val Gly Asn Ser Met Glu Glu Glu His Val Asp Glu Lys Asp Ser Glu 
35 40 45 

Lys Met Glu Asp Ser Glu Gly Arg Gin Asn Val Trp Val Pro Gin Asp 
50 55 60 

Arg Gly Lys Glu Tyr Ala Pro Glu Gin Tyr Ala Arg Asp He He Glu 
65 70 75 80 

His Tvr He Pro Ala Ala Arg Asp His Pro Pro Gin Pro Gin Gin Pro 

85 90 95 

Pro Pro Pro Leu Pro Thr Pro Lys Pro Pro Arg Arg Arg Lys Ser Gly 
100 105 110 

Gin Lys Thr Asp Gin Thr Thr Pro Ser Ser Asp Ala Glu Ala Ser Ser 
115 120 125 

Asp Pro Ala Pro Pro Val Pro Ala Ala Pro Val Ala Pro Val Val Pro 
130 135 140 

He Val Pro He Val Pro Val His Pro Val Pro Leu Pro Asn Gly Ser 
145 150- 155 160 
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Val Asr Thr Pro Lys Val Lys Thr lie Ala Lys Thr Thr Ala Arg Val 

165 1*70 175 

Leu Tyr Ser lie Lys Pro Gin He Pro Pro He Ala Asn Lys Thr Val 
180 185 190 

Tyr Pro Val Lys Lys Leu Thr Pro Ser Val Val Pro Ser Pro Met He 
195 200 205 

Leu Asn Gly Asn Thr Ala Thr Ala Ser Pro Ser Lys Asn Ala Ala Ser 
210 215 220 

Val Val Val Arg Asn Ala Tyr Thr Phe Ser Leu Gin Gin Lys Ala Pro 
225 230 235 240 

Tyr Tyr Pro Ala Gly Met Arg Pro Lys Pro Thr Gin Asn Gly lie Glu 

245 250 255 

Thr Pro Pro Thr Gly Ala Gin Ser Leu Met Arg Ala Ala Phe Tyr Ser 
260 265 270 

Glu Ser His Pro Thr Arg Ser Pro Leu Val Pro Tyr Gly Phe Val Pro 
275 280 285 

Pro Val Ala Thr Ser Ser Thr Phe Val Pro Ala Ala Thr He Pro Ser 
290 295 300 

Pro Ala Ser Arg Ala He Ala His Gin Lys Gin Met Leu Leu Asn Thr 
305 310 315 320 

Glu Thr Cys Arg Arg Val Met Pro Phe Asn He Gin Met Ala Phe Lys 

325 330 ' 335 

Pro Arg Arg Trp Asp Pro Leu Pro Lys Ser Ser Gly Val Leu Ala His 
340 345 350 

Ser Asn Ser Thr He Pro Tyr Val Gin Arg Val Pro Asn Asn Ser Thr 
355 360 365 

Gin Ser Asp Phe Arg Pro Arg Ser Phe Ser Gin Asn Ser Val Ala Ser 
370 375 380 

Pro Ala Pro Ala Pro Val Pro Asn Ala He Lys Arg Arg Glu Val Gly 
385 390 395 400 

Asn Leu Lys Ser Arg Gin Tyr Val Pro Trp He Ala Asn Ser Arg Ala 

405 410 415 

Leu Val Ala Ala Ala Met Ala Thr Met Glu Glu Thr Ala Glu Lys Met 
420 425 430 

Ser Ser Ser Pro Leu Leu Ser Ser Gin Ala Pro Met Thr Thr Leu Met 
435 440 445 

Pro Thr Pro Pro Pro Pro Ala Pro Ala Pro Ala Gin Ala Ser Ala Gin 
450 455 460 

Ser Thr Ser Ala Thr Pro Ala Leu Val Asp Thr He Ser Ala Gly Ser 
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465 



470 



475 



480 



Thr Thr Thr Glu Thr Thr Thr Gly Asp Ser Asn Gin Ser Asn Pro Pro 

485 490 495 

Leu Arg Thr Tyr Thr Ser His lie Arg Lys Thr Pro Gly Thr Thr Leu 

500 505 510 

Thr Pro Glu Glu lie Gly Asp Ala He Arg Thr Glu Ser Gin Arg Phe 

515 ^ ' 520 525 

Gin Glu Asp Gly Asp Glu Gly Pro Thr Val Lys Ser Phe Leu Met Asn 

530 535 540 



He Tyr Lys 
545 



<210> 49 
<211> 1248 
<212> DNA 

<213> Homo sapiens 
<400> 49 

atgcagatgt cctacgccat ccggtgcgcc ttctaccagc tgctgctggc cgcgcrcatg 60 

ctggtggcga tgctgcagct gctctacctg tcgctgctgt ccggactgca cgggcaggag 120 

gagcaagacc aatattttga gttctttccc ccgtccccac ggtccgtgga ccaggtcaag 180 

gcgcagctcc gcaccgcgct ggcctctgga ggcgtcctgg acgctagcgg cgatcaccgc 240 

gtctacaggg gcctgctgaa gaccaccatg gaccccaacg atgtgatcct ggccacgcac 300 

gccagcgtgg acaacctgct gcacctgtcg ggtctgctgg agcgctggga gggcccgctg 360 

tccgtgtcgg tgttcgcggc caccaaggag gaggcgcagc tggccacggt gctggcctac 420 

gcgctgagca gccactgccc cgacatgcgc gccagggtcg ccatgcacct cgtgtgcccc 480 

tcgcgttacg aggcagccgt gcccgacccc cgggagccgg gggagtttgc cctgctgcgg 540 

tcctgccagg aggtctttga caagctagcc agggtggccc agcccgggat taattatgcg 600 

ctgggcacca atgtctccta ccccaataac ctgctgagga atctggctcg tgagggggcc 660 

aactatgccc tggtgatcga tgtggacatg gtgcccagcg aggggctgtg gagaggcctg 720 

cgggaaatgc tggatcagag caaccagtgg ggaggcaccg cgctggtggt gccrgccttc 780 

gaaatccgaa gagcccgccg catgcccatg aacaaaaacg agctggtgca gctctaccag 840 

gttggcgagg tgcggccctt ctattatggg ttgtgcaccc cctgccaggc acccaccaac 900 

tattcccgct gggtcaacct gccggaagag agcttgctgc ggcccgccta cgtggtacct 960 

tggcaggacc cctgggagcc attctacgtg gcaggaggca aggtgcccac cttcgacgag 1020 

cgctttcggc agtacggctt caaccgaatc agccaggcct gcgagctgca tgtggcgggg 1080 

tttgattttg aggtcctgaa cgaaggtttc ttggttcata agggcttcaa agaagcgttg 1140 

aagttccatc cccaaaagga ggctgaaaat cagcacaata agatcctata tcgccagttc 1200 

aaacaggagt tgaaggccaa gtaccccaac tctccccgac gctgctga 1248 

<210> 50 
<211> 415 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Met Gin Met Ser Tyr Ala He Arg Cys Ala Phe Tyr Gin Leu Leu Leu 
15 10 15 



Ala Ala Leu Met Leu Val Ala Met Leu Gin Leu Leu Tyr Leu Ser Leu 
20 25 30 
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Leu Ser Gly Leu His Gly Gin Glu Glu Gin Asp Gin Tyr Phe Glu Phe 
35 40 4 5 

Phe Pro Pro Ser Pro Arg Ser Val Asp Gin Val Lys Ala Gin Leu Arg 
50 55 60 

■-hr Ala Leu Ala Ser Gly Gly Val Leu Asp Ala Ser Gly Asp Tyr Arg 

'el ' 70 75 80 

Val Tyr Arg Gly Leu Leu Lys Thr Thr Met Asp Pro Asn Asp Val He 

85 90 95 

Leu Ala Thr His Ala Ser Val Asp Asn Leu Leu His Leu Ser Gly Leu 
100 105 110 

Leu Glu Arg Trp Glu Gly Pro Leu Ser Val Ser Val Phe Ala Ala Thr 
115 120 125 

Lys Glu Glu Ala Gin Leu Ala Thr Val Leu Ala Tyr Ala Leu Ser Ser 
130 135 140 

His Cvs Pro Asp Met Arg Ala Arg Val Ala Met His Leu Val Cys Pro 
145 ' 150 155 160 

Ser Arq Tyr Glu Ala Ala Val Pro Asp Pro Arg Glu Pro Gly Glu Phe 

165 110 l^S 

Ala Leu Leu Arg Ser Cys Gin Glu Val Phe Asp Lys Leu Ala Arg Val 
180 185 190 

Ala Gin Pro Gly He Asn Tyr Ala Leu Gly Thr Asn Val Ser Tyr Pro 
195 200 205 

Asn Asn Leu Leu Arg Asn Leu Ala Arg Glu Gly Ala Asn Tyr Ala Leu 
210 215 220 

Val He Asp Val Asp Met Val Pro Ser Glu Gly Leu Trp Arg Gly Leu 
225 230 235 240 

Arq Glu Met Leu Asp Gin Ser Asn Gin Trp Gly Gly Thr Ala Leu Val 

245 250 255 

Val Pro Ala Phe Glu He Arg Arg Ala Arg Arg Met Pro Met Asn Lys 
260 265 270 

Asn Glu Leu Val Gin Leu Tyr Gin Val Gly Glu Val Arg Pro Phe Tyr 
275 280 285 

Tyr Gly Leu Cys Thr Pro Cys Gin Ala Pro Thr Asn Tyr Ser Arg Trp 
290 295 300 

Val Asn Leu Pro Glu Glu Ser Leu Leu Arg Pro Ala Tyr Val Val Pro 
305 310 315 320 

Trp Gin Asp Pro Trp Glu Pro Phe Tyr Val Ala Gly Gly Lys Val Pro 

325 330 335 

Thr Phe Asp Glu Arg Phe Arg Gin Tyr Gly Phe Asn Arg He Ser Gin 
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340 345 350 

Ala Cys Glu Leu His Val Ala Gly Phe Asp Phe Glu Val Leu Asn Glu 
355 360 365 

Gly Phe Leu Val His Lys Gly Phe Lys Glu Ala Leu Lys Phe His Pro 
370 375 380 

Gin Lys Glu Ala Glu Asn Gin His Asn Lys He Leu Tyr Arg Gin Phe 
385 ' 390 395 400 

Lys Gin Glu Leu Lys Ala Lys Tyr Pro Asn Ser Pro Arg Arg Cys 
405 410 415 



<210> 51 

<211> 557 

<212> DNA 

<213> Homo sapiens 

<400> 51 

caggcaccca ccaacLattc ccgctgggtc aacctgccgg aagagagctt gctgcggccc 60 

gcctacgtgg tacctrqcca gnacccctgg gagccattct acgtggcagg aggcaaggtg 120 

cccaccttcg acgagcgctr tcggcagtac ggcttcaacc gaatcaagcc aggcctgcga 180 

gctgcatgtg qcqqaqtztq atrttgaggt cctgaacgaa ggtttcttgg ttcataaggg 240 

cttcaaagaa gcgrtgaagt tccatcccca aaaggaggct gaaaatcagc acaataagat 300 

cctatatcgc cagttcaaac aqgagttgaa ggccaagtac cccaactctc cccgacgctg 360 

ctgagccctt ccc^ccocza arctgagaag tcagcctctt ggctcctcag gccaccattt 420 

aaggcctgac tggggcaaga aatgtcgctn cactttacag angtagcttg tggtgttgaa 480 

acactggact tggaratggg gtgcttggga atcgattcct aactttacca ctactaactt 540 
gngtggnctt gagtaaa ^^"^ 

<210> 52 

<211> 646 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gggtgtgaac acgcacttat tracacattg tcatcggtag gcacataccc agcccagccc 60 

gttgaaggga cagatctcag gctgtcaccc atggnaacct gcagaccctt ccctcctcca 120 

gaacccttga aggcgaagcc ctgagagaac caggaatttt aggcttctgt tcaagagcta 180 

agaactaaat ttrargccut catctgattt ctttccaaaa agtccatttc attaagtatt 240 

cagacttctt agctccatcc cattcatact ttttgctctc ctactaccca cccaagattg 300 

ttaataataa caataataat aacaacaata atactgcgat aatattaata cttcacattt 360 

gtacgaagct tacagaatgt trtcacatat agcatctcat ctgagcctcc caacagttcc 420 

gtgaggtagg tattcrcacc tcccttttta cagacagggt aaccgaggct nagagaggta 480 

cgggatttac tcaaggccac acagctagtt agtgggtaaa gctaggaatc gatcccagca 540 

cccccatatt caaggtccag gggrttaaac accacagctt ccttntggtn aagtgggagc 600 

gacatttntt accccagtcc ggcctaaaat gggggcctga nggagc 646 

<210> 53 

<211> 121 

<212> PRT 

<213> Homo sapiens 

<400> 53 

Cys Gin Ala Pro Thr Asn Tyr Ser Arg Trp Val Asn Leu Pro Glu Glu 
15 10 15 
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Ser Leu Leu Arq Pro Ala Tyr Val Val Pro Trp Gin Asp Pro Trp Glu 
20 . 25 30 

Pro Phe Tyr Val Ala Gly Gly Lys Val Pro Thr Phe Asp Glu Arg Phe 
35 40 45 

Arg Gin Tyr Gly Phe Asn Arg lie Ser Gin Ala Cys Glu Leu His Val 
50 55 60 

Ala Glv Phe Asp Phe Glu Val Leu Asn Glu Gly Phe Leu Val His Lys 
65 ' 70 75 80 

Gly Phe Lys Glu Ala Leu Lys Phe His Pro Gin Lys Glu Ala Glu Asn 

85 90 95 

Gin His Asn Lys lie Leu Tyr Arg Gin Phe Lys Gin Glu Leu Lys Ala 
100 105 110 

Lys Tvr Pro Asn Ser Pro Arg Arg Cys 
115 120 



<210> 54 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<400> 54 

cctccttcct tgcccctccc tacctccctg cctctacatg gcttctctcc acccctccca 60 

gagtcctacg ggacaggacc ctgctccagt ggtatccaac tcctccctgc ccactcttcc 120 

tcatgggcca cctcacctcc cactttcgat gtctcgcctc ccgtggccac cctgcaatta 

gctttccaag ccccctcccg tggccgtccc ctcccaagac ctctcaccca tgtagcaatc 

cctacatggc tgcctgtcat gtccctactc tctaagccct cctgcccact gttcctccct 

ccccgacatg ctgacaccaa gtggtggaaa ccaccccrca gccccaagcc tgccctgtgc 

agagtrcagc tttgtgttga atgagggggg agagggacaa gtgagggcgg agagagaagr 420 

tcaggaggag gcagggatgc gcanggagca ganagtgaag gaaggaagat ccgaacagat 480 

tgacngaaaa cgttggaccg naaaagttgg tttntcctaa tttttttccg ggagaacccg 540 
ttacacagtt nt 

<210> 55 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<400> 55 

ggaaagtttt tgttttcttt ttcccacaca tttccggggt tggggtgttt tccaagactc 60 

agacacattt gttaacaaag agagaaaaaa aactggggga gcagggagcc cgtgggcaaa 120 

gaagtgaccc cagcagtctg tggacaatgc cttgctccct cttccctgct gaccgcgccc 180 

agcgggtgcc acaggctgct gctcgtggaa tctagagtat ttgtctgraa tatatatctg 240 

catttgcctt tcccctccct cccaccccnn acccctgctc ctccagcagc ttccccatca 300 

atgcacgtcg nccggcggna cacacagaac aggccttccg tcaggcctga gccccttccc 360 

tgggcggcac caaagcaggt gccnntnctg gtgaggggag ttggggcact tgccccagcc 420 

nancanactn acacctgggc cantncggna nnccctntnn cnttcnntcn aaccnattct 480 

ggaancccnn ngggaannaa nnggnancnt annnccnnna tcncnaannn aanaatnanc 540 

nannnncnng nnncnnncnn ncannnccan ncaccccang nnacgnnana tcgnannccc 600 

ttgnnctcaa ancgaancgc cnccacnncc tacagganca nanncnnaac tcagngaaan 660 

tcnaccnrac tnccanncan cncttcccaa ccnnntcatc cttcnttncn ncnatccnnc 720 



180 
240 
300 
360 
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aatataannn cncttataca naactccacn ngnt 754 

<210> 56 
<211> 555 
<212> DNA 

<213> Homo sapiens 
<400> 56 

gggcagccgg gcgagtccca gggcaggggt cczccqccgc cttgcacctg ccctgctggg 60 

cggcaccggg rcagtgccct gccccczccZ gcgggtccca actctctctt tcccancgtg 120 

cgtcctctgg agaagtgcgc gcgtgagctg acatggaccc aaatcctcgg gccgccctgg 180 

agcgccagca gctccgcctt cgggagcggc aaaaattctt cgaggacatt ttacaagcca 240 

gagacagagt ttgtctttcc tctgtcccat ctgcatctcg agccgcagag accccccata 300 

ggtaagtatc tcatccatgg aagtgaatgt ggacacactg gagcaagtag aacttattga 360 

ccttggggac ccggatgcag cagatgtgtt cttgccttgc gaagatcctt caccaacccc 420 

caagtccgtc tgggatggac aaccatttgg aaggaagctg agcctgcccg tgcctacatc 480 

agacaaggac cacattanga acrtcttctt cctcctcctn cgactcttca ncaanctgga 540 

taagccaaat caagt 555 

<210> 57 
<211> 611 
<212> DNA 

<213> Homo sapiens 
<400> 57 

nggcntnttn ttnttagttn nttnttnanc ccggnctttn nnccaaaagg gnnntanggg 60 
ggcncnnngg ntngngnccc ncnttacttg annngtnang ggntnnncat ccntttaaaa 120 
ccncttgnnt gaaaaacLgt tttaaaaaac tncggnangg ttnaggggng ggaanagnnc 180 
taaaaaaagc nggggntttt ngnccaaccn aantttntnt tncctaattn gcaaatcntn 240 
tntcaggggt aanccaaaaa ctggnggnag gnttncnccn ggaaaaantt accnttaaan 300 
cagganaggg ttaaattntn aaaaagggcc ccaattcccc ccatcnttcc caccttnggg 360 
ggccnnctgc nagtaaanag nctggtcttt tccccaanag ggnttttggc tggcccngng 420 
gcccnnattn gggnnnaatn cccccnccgn gggcacaann nttncaagcc agggcccccc 480 
nttggttaaa tttnaagggn nccnagggtt tttgncccct ttnanaaccc ccctttnccc 540 
cnccttttaa aanccnncct ttccccccaa nggnnccttt ttntgncccc aanannnacc 600 
tgggatttgg g 

<210> 58 
<211> 4425 
<212> DNA 

<213> Homo sapiens 
<400> 58 

atggggaaga acaaactcct. tcatccaagt ctggttcttc tcctcttggt cctcctgccc 60 

acagacgect cagtctctgg aaaaccgcag tatatggttc tggtcccctc cctgcrccac 120 

actgagacca ctgagaaggg ctgtgtcctt ctgagctacc tgaatgagac agtgactgta 180 

agtgcttcct tggagtctgt caggggaaac aggagcctct tcactgacct ggaggcggag 240 

aatgacgtac tccactgtgt cgcctrcgct gtcccaaagt cttcatccaa tgaggaggta 300 

atgttcctca ctgtccaagt gaaaggacca acccaagaat ttaagaagcg gaccacagtg 360 

atggttaaga acgaggacag tctggtcttt gtccagacag acaaatcaat ctacaaacca 420 

gggcagacag tgaaatttcg tgttgtctcc atggatgaaa actttcaccc cctgaatgag 480 

ttgatrccac tagtatacat tcaggatccc aaaggaaatc gcatcgcaca atggcagagt 540 

ttccagttag agggtggcct caagcaattt tcttttcccc tctcatcaga gcccrtccag 600 

ggctcctaca aggtggtggt acagaagaaa tcaggtggaa ggacagagca ccctttcacc 660 

gtggaggaat ttgttcttcc caagtttgaa gtacaagtaa cagtgccaaa gataatcacc 720 

atcttggaag aagagatgaa tgtatcagtg tgtggcctat acacatatgg gaagcctgtc 780 

cctggacatg tgactgtgag catttgcaga aagtatagtg acgcttccga ctgccacggt 840 

gaagattcac aggctttctg tgagaaattc agtggacagc taaacagcca tggcizgcttc 900 



wo 00/73328 



PCT/EPOO/05108 



79 

tatcagcaag taaaaaccaa ggtcttccag ctgaagagga aggagtatga aatgaaactt 960 
cacactaaqq cccaaatcca agaagaagga acagtggtgg aattgactgg aaggcagtcc 1020 
agtgaaatca caagaaccat aaccaaactc tcatttgtga aagtggactc acactttcga iU80 
cagggaattc ccttctttgg gcaggtgcgc ctagtagatg ggaaaggcgt ccctatacca 1140 
aataaagtca tattcatcag aggaaatgaa gcaaactatt actccaatgc taccacggat 1200 
gagcatggcc rtgtacagtt ctctatcaac accaccaacg ttatgggtac ctctcttact 1260 
gttagggtca atracaagga tcgtagtccc tgttacggct accagtgggt gtcagaagaa 1320 
cacgaagagg cacatcacac ::gcttatctt gtgttctccc caagcaagag ctttgtccac 1380 
cttgagccca rgi:crcatga actaccctgt ggccatactc agacagtcca ggcacattat 1440 
attctgaatg gaggcaccct gctggggctg aagaagctct cctrttatta tctgataarg 1500 
gcaaagggag gcattgtccg aactgggact catggactgc ttgtgaagca ggaagacatg 1560 
aagggccatL tttccatctc aatccctgtg aagtcagaca ttgctcctgt cgctcggttg 1620 
ctcatctatg ctgttttacc taccggggac gtgattgggg attctgcaaa atatgatgtt 1680 
gaaaattgrc tggccaacaa ggtggatttg agcttcagcc catcacaaag tctcccagcc 1740 
tcacacgccc acctgcgagt cacagcggct cctcagtccg tctgcgccct ccgtgctgtg 1800 
gaccaaagcg tgctgctcat gaagcctgat gctgagctct cggcgtcctc ggtttacaac 1860 
ctgctaccag aaaaggacct cactggcttc cctgggcctt tgaatgacca ggacgatgaa 1920 
gactgcatca atcgtcacaa tgtctatatt aatggaatca catatactcc agtatcaagt 1980 
acaaatgaaa aggatatgta cagcttccta gaggacatgg gcttaaaggc attcaccaac 2040 
tcaaagatrc graaacccaa aatgtgtcca cagcttcaac agtatgaaat gcatggacct 2100 
gaaggtctac qtqtagqttr teatgagtca gatgtaatgg gaagaggcca tgcacgcctg 2160 
gtgcatgtrg aaaaacctca cacggagacc gtacgaaagt acttccctga gacatggarc 2220 
tgggatttgg tggnggraaa c^cagcaggg gtggctgagg taggagtaac agtccctgac 2280 
accatcaccg aqtggaaggc aggggccttc tgcctgtctg aagatgctgg acttggtatc 2340 
tcttccactg cctctctccg agccttccag cccttctttg tggagcrtac aatgccttac 2400 
tctgtgattc gtggagaggc ctccacactc aaggccacgg tcctaaacta ccttcccaaa 2460 
tgcatccggg tcagrgcgca gctggaagcc tctcccgcct tccttgctgt cccagtggag 2520 
aaggaacaag cgcctcactg catctgtgca aacgggcggc aaactgtgtc ctgggcagta 2580 
accccaaagt carraggaaa tgtgaatttc actgtgagcg cagaggcact agagtctcaa 2640 
gagctgtgtg ggactgaggt occttcagtt cctgaacacg gaaggaaaga cacagtcatc 2700 
aagcctctgt tggttgaacc tgaaggacta gagaaggaaa caacattcaa ctccctactt 2760 
tgtccatcag gtggtgaggt t^ctgaagaa ttatccctga aactgccacc aaatgtggta 2820 
gaagaatctg cccgagcttc tgtctcagtt ttgggagaca tattaggctc tgccatgcaa 2880 
aacacacaaa atcttctcca gatgccctat ggctgtggag agcagaatat ggtcctcttt 2940 
gctcctaaca tctatgtact ggattatcta aatgaaacac agcagcttac tccagaggtc 3000 
aagtccaagg ccattggcta tctcaacact ggttaccaga gacagttgaa ctacaaacac 3060 
tatgatggct cctacagcac crttggggag cgatatggca ggaaccaggg caacacctgg 3120 
ctcacagcct ttgttctgaa gacttttgcc caagctcgag cctacatctt catcgatgaa 3180 
gcacacatta cccaagccct catatggctc tcccagaggc agaaggacaa tggctgtttc 3240 
aggagctctg ggtcactgct caacaatgcc ataaagggag gagtagaaga tgaagtgacc 3300 
ctctccgcct atatcaccat cgcccttctg gagattcctc tcacagtcac tcaccctgtt 3360 
gtccgcaatg ccctgttttg cctggagtca gcctggaaga cagcacaaga aggggaccat 3420 
ggcagccatg tatataccaa agcactgctg gcctatgctt ttgccctggc aggtaaccag 3480 
gacaagagga aggaagtact caagtcactt aatgaggaag ctgtgaagaa agacaactct 3540 
gtccattggg agcgccctca gaaacccaag gcaccagtgg ggcattttta cgaaccccag 3600 
gctccctctg ctgaggtgga gatgacatcc tatgtgctcc tcgcttatct cacggcccag 3660 
ccagccccaa cctcggagga cctgacctct gcaaccaaca tcgtgaagtg gatcacgaag 3720 
cagcagaatg cccagggcgg tttctcctcc acccaggaca cagtggrggc tctccatgct 3780 
ctgtccaaat atggagccgc cacatttacc aggactggga aggctgcaca ggtgactatc 3840 
cagtcttcag ggacattttc cagcaaattc caagtggaca acaacaatcg cctgttactg 3900 
cagcaggtct cattgccaga gctgcctggg gaatacagca tgaaagtgac aggagaagga 3960 
tgtgtctacc tccagacctc cttgaaatac aatattctcc cagaaaagga agagttcccc 4020 
tttgctttag gagtgcagac tctgcctcaa acttgtgatg aacccaaagc ccacaccagc 4080 
ttccaaatct ccctaagtgt cagttacaca gggagccgct ctgcctccaa catggcgatc 4140 
gttgatgtga agatggtctc rggcttcatt cccctgaagc caacagtgaa aatgcttgaa 4200 
agatctaacc atgtgagccg gacagaagtc agcagcaacc atgtcrtgat ttaccttgat 4260 
aaggtgtcaa atcagacact gagcttgttc ttcacggttc tgcaagatgt cccagtaaga 4320 
gatctcaaac cagccatagt gaaagtctat gattactacg agacggatga gtttgcaatc 4380 
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gctgagnaca acgctccttg cagcaaagat cttggaaai:g cttga 4425 

<210> 59 
<211> 1474 
<212> PRT 

<213> Homo sapiens 

^ r, 5 9 

Lvs Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
; ' ' "5 10 15 

1-u Leu Pro Thr Asp Ala Ser Vai Ser Gly Lys Pro Gin Tyr Met 
20 25 30 



Vai Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

■ ^ ^--^ Leu Ser- Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 

55 60 

' ^ - Val Ara Glv Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
^ ^ 70 75 80 

Veil Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 105 110 

Glu Pne Lys Lvs Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 ^ 120 125 

Val Phe Val Gin Thr Asp Lvs Ser lie Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Ph- Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Vai Tyr He Gin Asp Pro Lys Gly Asn Arg He Ala 

165 170 175 

Gin Tro Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lvs Lvs Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Vai Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 265 270 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Giu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lvs Glu Tyr Glu Mer Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 

325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asd Glv Lvs Gly Val Pro He Pro Asn Lys Val He 
370 ^ ^ 375 380 

Phe He Arg Glv Asn Giu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 ^ 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 

405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 

485 490 495 

Tyr Leu He Met Ala Lvs Gly Gly He Val Arg Thr Gly Thr His Gly 
500 ^ 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 

565 570 575 
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Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
' 595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys lie Asn Arg His Asn Val Tyr lie Asn Gly lie Thr Tyr Thr 

645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys lie Arg Lys Pro Lys Met 
675 680 685 

Cvs Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 
690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 720 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 

725 730 735 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 
740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 

805 810 815 

Tyr Leu Pro Lvs Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 
820 825 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 



Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 
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885 890 895 

Asp Thr Val lie Lys Pro Leu Leu Val Glu Pro Giu Gly Leu Glu Ly^ 
900 905 910 

Glu Thr Tnr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Giu Val Ser 
915 920 925 

Glu Glu Leu Ser Leu Lvs Leu Pro Pro Asn Val Vai Giu Glu Ser Ala 
930 935 940 

Arg Se^ Vai Ser Vai Leu Giy Asp lie Leu Gly Ser Ala Met Gin 

945 950 955 960 

Asn T-r Gin Asn Leu Leu Gin Mer Pro Tyr Giy Cys Giy Giu Gin Asn 
965 970 975 

Met Val Leu Phe Ala Pro Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu 
980 985 990 

Thr Gin Gin Leu Thr Pro Glu Val Lys Ser Lys Aia lie Gly Tyr Leu 
995 1000 i005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Giy Ser 
1010 1015 i020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Giy Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Vai Leu Lys Thr Phe Aia Gin Aia Arg Aia Tyr lie 
1045 1050 1055 

Phe I^e Asp Glu Aia His lie Thr Gin Aia Leu lie Trp Leu Ser Gin 
1060 1065 1070 

Arg Gin Lvs Asp Asn Glv Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1075 1080 1085 

Asn Ala lie Lys Gly Gly Val Giu Asp Giu Val Thr Leu Ser Aia Tyr 
1090 1095 1100 

lie Thr He Aia Leu Leu Glu lie Pro Leu Thr Vai Thr His Pro Vai 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 
1125 1130 1135 

Glu Glv Asp His Giy Ser His Vai Tyr Thr Lys Aia Leu Leu Aia Tyr 
1140 11^5 1150 

Ala Phe Aia Leu Ala Gly Asn Gin Asp Lys Arg Lys Giu Vai Leu Lys 
1155 1160 1165 

Ser Leu Asn Giu Giu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 
1170 1175 1180 

Arg Pro Gin Lys Pro Lys Aia Pro Vai Giy His Phe Tyr Giu Pro Gin 
1185 1190 1195 1200 
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Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 
1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 
1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1235 1240 1245 



Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 
1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 
1285 1290 1295 

Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 
1300 1305 1310 



Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 
1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 
1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 
1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 
1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 
1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 
1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 



<210> 60 
<211> 722 
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<213> Homo sapiens 



<400> 60 

ngaccnnaaa aagngtttat tcatcaagtn tttaaagatn caaaaacacg tgtnttntgn 60 

ggagctntga naacaggact ccagcaaagc acttttcagc cttggggnct tcaagcatrt 120 

ccaagacctt tgctgcaagg agcattgnnc tcagcaattg caaactcarc cttnrcgtag 1^0 

naancaraga ctttcactat ggctggtttc ananctntta ctgggacatt ttgcanaacc 240 

gngaanaaca agctcagggn crgatttgac accttntcaa ggtaaatcaa gacarggtcg 300 

ctgntgactt ctgncccggn tcacatgggt tagatctttc aagcnttttt nactgnnngg 360 

cttcagggga atgaaacccc gagaccntnr tnncaatnaa cgacncccnt nttgggaggc 420 

aaaccggntc cctgngtaac ctnnccctta gggganattt ggaaanctng gtgtgggncn 480 

tttgggttca tnnnnaaggt ttngaggcna agnnctgnct tcnnaaagca aaggggnacc 54 0 

tnttccnttt ttntggnnaa antttgnttt ttcaaggnat tnnngaagnt annnncaacc 600 

ttctcccggn nntttcaang cnggntttcc cagggnagtt ttggnatagn nccnnttnna 660 

aaanncgggg ggttttttac ncccttggaa ttntnnggga aaaanncctn aaannnggga 720 

ac ^22 



<210> 61 
<211> 557 
<212> DNA 

<213> Homo sapiens 



<400> 61 

gtgtgcagtg cccccttgcc tcttcaaacc tacagcttct ctttgccatt tgtggatttc 60 

acatcactcc acacagaaac attacagcct ggcatcccca gtctttgcct tcttccagct 120 

gcctcgacac agcactgtgg cctgtcccta ttgcccaggc acgccatttc caagggcagg 180 

aaggggcagt gtcctgaagc ccatcttttc tgtgactgtc ttaggtgatg tgtagccccc 240 

tccacctttc cactcaacaa cctcccaccc ctgtcctgct gcatggtccg gagtctggga 300 

cctactttgt tttttgttat ttatgacctt gtttaaagaa aataaatatc tcccaacctt 360 

taaaaaaaaa aaaaaaaaaa aactcgagag atctatgaat cgaagatact gaaaaacccc 420 

gcangttcac ttcaactgtg catcgtgcan catctcaatt ctttcatttn atacatccnt 480 

tttgcctttc tttatgtaac tatactcctc taaagtttca atcttgggca ttnaaccttt 540 

gatctataaa attttta 5 57 



<210> 62 

<211> 640 

<212> DNA 

<2 1 3> Homo sapiens 



<400> 62 

ttaaaggttg ggagatattt attttcttta 
aggtcccaga ctccggacca tgcagcagga 
ggagggggct acacatcacc taagacagtc 
cttcctgccc ttggaaatgg cgtgcctggg 
ggcagctgga agaaggcaaa gactggggat 
tgtgaaatcc acaaatggca aagagaagct 
cacgtcgacg cggccgcgaa ttcggatccc 
gctagcgtaa tctggaacat cgtatgggta 
gtggggtatc ttcatcatcg aatagatagt 
catccctgta gtgattccaa acgcgttata 
nggtttgaat tattaccatc tcaattttac 



aacaaggtca taaataacaa aaaacaaagt 60 

caggggtggg aggttgttga gtggaaaggt 120 

acagaaaaga tgggcttcag gacactgccc 180 

caatagggac aggccacagt gctgtgtcga 240 

gccaggctgt aatgtttctg tgtggagtga 300 

gtaggtttga agaggcaagg gggcactgca 360 

cggggcctcc atggccatat gaccacccaa 420 

aagccataga gatctctttt tttgggtttg 480 

tatatacata tccattgtag tgggattaaa 540 

cgcagtttgg tccgtccaac caggtgacag 600 
tagccgggat 640 



<210> 63 

<211> 566 

<212> DNA 

<213> Homo sapiens 
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<400> 63 

ctcccccatc cgcatcctcc tcctcctgtc ccctcrtcct ctctgtccct caccaagcca 60 

tcccctcttt gcctctcccc atccctcgct ggcctgctcc tgctcccctt ctcccatcct 120 

cccczccccc gtctcrgccc agccagcccc ctcccgactc cccagtttca tcggactccc 180 

tggccccatc ccgtccccgc cctggcccct ttgtgcccct tctcatcgtt ttctccctcc 240 

ttcccgggct tggcgtccct tctcccccct aactccttcc ctcggcctcc ctgcccctcc 300 

acggcccgcc tgcctccctt gcccaagtcc tgagccacca rgctgacccc gatggtggcc 360 

cggnggggcg gtgrccccgg actcttctct ntncaagaac acgcttcagc cggctgcccc 420 

aagctacgct gggaggaggc cgacgcagcn ttgcctnagc caggcctggt ggtcctttgn 480 

ccagncatca tggccttcan tggcggtnca tnngnttcac ctnctggnag cacntatrga 540 

taccaanatg ggtggccnta tnncta 566 

<210> 64 
<211> 648 
<212> DNA 

<213> Homo sapiens 
<400> 64 

tggcgttcac gcagatcgtc trttattagc ggtctgtaaa gcaccnccca gggtcccccg 60 

accccagatt ggaggaagcc ttgagaggtc agtagcacca gggacarggc agggcccgag 120 

ggcgcgatgt gcagccgatg gtgagggact gggcgccctc gcctgccccc ggggttgtca 180 

gcactgggaa ggcttggggg tagcagccac ctccttcccc caacccaaca gactagtcca 240 

aatttgggta aataaataaa ataaataaga ttcctcaagc tggcctaccc tggagaggag 300 

ccgtggttgc agccggccac tcgggaggcc cgagggccag cgggggttag ttggggcgtc 360 

ctctcctctc gggtgatggg gagccctggg ggatggcagc ataggggctg ggatggcctt 420 

ggcagaggcc gtctncccac attctgactc cttggtcccc cttgaaaccc tgttggtgtc 480 

ccttcccaca aagcccttct tgccctcagt gggtggggaa ggccggtgcc cccttccctt 540 

cttcancgca aagggtntgc aggaaagggg caaaattagg gggnaaaaag gtcccttttt 600 

tcanccccnt tngtccccna aaagatgggg ccttttccnt ttngnggt 648 

<210> 65 
<211> 2274 
<212> PRT 
<213> Mus sp. 

<400> 65 

Met Thr Ser Ser Met Ala Ser Tyr Glu Gin Leu Val Arg Gin Val Glu 
15 10 15 

Ala Leu Lys Ala Glu Asn Thr His Leu Arg Gin Glu Leu Arg Asp Asn 
20 25 30 

Ser Ser His Leu Ser Lys Leu Glu Thr Glu Thr Ser Gly Met Lys Glu 
35 ' 40 45 

Val Leu Lys His Leu Gin Gly Lys Leu Glu Gin Glu Ala Arg Val Leu 
50 55 60 

Val Ser Ser Gly Gin Thr Glu Val Leu Glu Gin Leu Lys Ala Leu Gin 
65 70 75 80 

Thr AsD lie Ser Ser Leu Tyr Asn Leu Lys Phe His Ala Pro Ala Leu 

85 90 95 

Gly Pro Glu Pro Ala Ala Arg Thr Pro Glu Gly Ser Pro Val His Gly 
100 105 110 

Ser Gly Pro Ser Lys Asp Ser Phe Gly Glu Leu Ser Arg Ala Thr lie 
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115 120 125 

Arg Leu Leu Glu Glu Leu Asp Gin Glu Arg Cys Phe Leu Leu Ser Giu 
130 135 140 

lie Glu Lys Glu Glu Lys Glu Lys Leu Trp Tyr Tyr Ser Gin Leu Gin 
145 150 155 160 

Gly Leu Ser Lys Arg Leu Asp Glu Leu Pro His Val Asp Thr Phe Ser 

165 170 175 

Met Gin Met Asp Leu lie Arg Gin Gin Leu Glu Phe Giu Ala Gin His 
180 185 190 

lie Arg Ser Leu Met Giu Giu Arg Phe Gly Thr Ser Asp Giu Met Vai 
195 200 205 

Gin Arg Ala Gin lie Arg Ala Ser Arg Leu Giu Gin lie Asp Lys Glu 
210 215 220 

Leu Leu Giu Ala Gin Asd Arg Val Gin Gin Thr Giu Pro Gin Ala Leu 
225 230 235 240 

Leu Ala Val Lys Pro Val Aia Vai Glu Glu Glu Gin Giu Aia Giu Vai 

245 250 255 

Pro Thr His Pro Giu Asp Gly Thr Pro Gin Pro Giy Asn Ser Lys Vai 
260 265 270 

Giu Vai Vai Phe Trp Leu Leu Ser Met Leu Aia Thr Arg Asp Gin Giu 
275 280 285 

Asp Thr Aia Arg Thr Leu Leu Aia Met Ser Ser Ser Pro Giu Ser Cys 
290 295 300 

Val Aia Met Arg Arg Ser Giy Cys Leu Pro Leu Leu Leu Gin lie Leu 
305 310 315 320 

His Giy Thr Giu Aia Glv Ser Vai Giy Arg Aia Giy lie Pro Gly Ala 

325 330 335 

Pro Gly Aia Lys Asp Ala Arg Met Arg Aia Asn Ala Ala Leu His Asn 
340 345 350 

lie Vai Phe Ser Gin Pro Asp Gin Giy Leu Aia Arg Lys Giu Met Arg 
355 360 365 

Vai Leu His Vai Leu Giu Gin lie Arg Ala Tyr Cys Giu Thr Cys Trp 
370 375 380 

Asp Trp Leu Gin Aia Arg Asp Ser Giy Thr Giu Thr Pro Val Pro lie 
385 390 395 400 

Giu Pro Gin lie Cys Gin Ala Thr Cys Ala Val Met Lys Leu Ser Phe 

405 410 415 

Asp Glu Glu Tyr Arg Arg Aia Met Asn Giu Leu Gly Gly Leu Gin Aia 
420 425 430 
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Val Ala Giu Leu Leu Gin Val Asp Tyr Glu Met His Lys Met Thr Arg 
435 440 445 

Asp Pro Leu Asn Leu Ala Leu Arg Arg Tyr Ala Gly Met Thr Leu Thr 
450 455 460 

Asn Leu Thr Pne Gly Asp Val Ala Asn Lys Ala Thr Leu Cys Ala Arg 
465 ^ 470 475 480 

Arg Gly Cys Met Glu Ala lie Val Ala Gin Leu Gly Ser Glu Ser Glu 
485 490 495 

Glu Leu His Gin Val Val Ser Ser lie Leu Arg Asn Leu Ser Trp Arg 
500 505 510 

Ala Asp lie Asn Ser Lys Lys Val Leu Arg Glu Val Gly Ser Met Thr 
515 520 525 

Ala Leu Mez Glu Cys Val Leu Arg Ala Ser Lys Glu Ser Thr Leu Lys 
530 ' 535 540 

Ser Val Leu Ser Ala Leu Trp Asn Leu Ser Ala His Ser Thr Glu Asn 
545 550 555 560 

Lys Ala Ala lie Cys Gin Val Asp Gly Ala Leu Gly Phe Leu Val Ser 
565 570 575 

Thr Leu Thr Tyr Arg Cvs Gin Gly Asn Ser Leu Ala Val lie Glu Ser 
580 ' 585 590 

Gly Gly Gly lie Leu Arg Asn Val Ser Ser Leu lie Ala Thr Arg Glu 
595 ' 600 605 

Asp Tyr Arg Gin Val Leu Arg Asp His Asn Cys Leu Gin Thr Leu Leu 
610 615 620 

Gin His Leu Thr Ser His Ser Leu Thr lie Val Ser Asn Ala Cys Gly 
625 630 635 640 

Thr Leu Trp Asn Leu Ser Ala Arg Ser Pro Arg Asp Gin Glu Leu Leu 
645 650 655 

Trp Asp Leu Gly Ala Val Gly Met Leu Arg Asn Leu Val His Ser Lys 
660 665 670 

His Lys Met lie A.ia Met Gly Ser Ala Ala Ala Leu Arg Asn Leu Leu 
675 680 685 

Ala His Arg Pro Ala Lys Tyr Gin Ala Ala Ala Met Ala Val Ser Pro 
690 ^ 695 700 

Gly Thr Cys Val Pro Ser Leu Tyr Val Arg Lys Gin Arg Ala Leu Glu 
705 710 715 720 

Ala Glu Leu Asp Thr Arg His Leu Val His Ala Leu Gly His Leu Glu 
725 730 735 
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Lys G""^ Ser Leu Pro Glu Ala Glu Thr Thr Ser Lys Lys Pro Leu Pro 
740 "745 750 

Pro Leu Arg His Leu Asp Gly Leu Val Gin Asp Tyr Ala Ser Asp Ser 
755 "760 765 

Gly Cys Phe Asp Asp Asd Asp Ala Pro Ser Leu Ala Ala Ala Ala Thr 
770 775 780 

Thr Ala Glu Pro Ala Ser Pro Ala Val Met Ser Met Phe Leu Gly Gly 
785 790 795 800 

Pro Phe Leu Gin Gly Gin Ala Leu Ala Arg Thr Pro Pro Ala Arg Gin 

805 810 815 

Gly Gly Leu Glu Ala Glu Lys Glu Ala Gly Gly Glu Ala Ala Val Ala 
820 825 830 

Ala Lys Ala Lys Ala Lys Leu Ala Leu Ala Val Ala Arg lie Asp Arg 
835 840 845 

Leu Val Glu Asd He Ser Ala Leu His Thr Ser Ser Asp Asp Ser Phe 
850 ' 855 860 

Ser Leu Ser Ser Gly Asp Pro Gly Gin Glu Ala Pro Arg Glu Gly Arg 
865 870 875 880 

Ala Gin Ser Cys Ser Pro Cys Arg Gly Thr Glu Gly Gly Arg Arg Glu 

885 890 895 

Ala Gly Ser Arg Ala His Pro Leu Leu Arg Leu Lys Ala Ala His Thr 
900 905 910 

Ser Leu Ser Asn Asp Ser Leu Asn Ser Gly Ser Thr Ser Asp Gly Tyr 
915 - 920 925 

Cys Thr Arg Glu His Met Thr Pro Cys Pro Leu Ala Ala Leu Ala Glu 
930 935 940 

His Arg Asd Asp Pro Val Arg Gly Gin Thr Arg Pro Arg Arg Leu Asp 
945 ' 950 955 960 

Leu Asp Leu Pro Ser Arg Ala Glu Leu Pro Ala Arg Asp Thr Ala Ala 

965 970 975 

Thr Asp Ala Arg Val Arg Thr He Lys Leu Ser Pro Thr Tyr Gin His 
980 985 990 

val Pro Leu Leu Asp Gly Ala Ala Gly Ala Gly Val Arg Pro Leu Val 
995 1000 1005 

Gly Pro Gly Thr Ser Pro Gly Ala Arg Lys Gin Ala Trp He Pro Ala 
1010 1015 1020 

Asp Ser Leu Ser Lys Val Pro Glu Lys Leu Val Ala Ser Pro Leu Pro 
1025 1030 1035 1040 

He Ala Ser Lys Val Leu Gin Lys Leu Val Ala Gin Asp Gly Pro Met 
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90 

1050 



1055 



Ser Leu Ser Arg Cys Ser Ser Leu Ser Ser Leu Ser Ser Thr Gly His 
1060 1065 1070 

Ala Val Pro Ser Gin Ala Glu Asn Leu Asp Ser Asp Ser Ser Leu Glu 
1075 1080 1085 

Gly Leu Glu Glu Ala Gly Pro Gly Glu Ala Glu Leu Gly Arg Ala Trp 
1090 1095 1100 

Arg Ala Ser Gly Ser Thr Ser Leu Pro Val Ser lie Pro Ala Pro Gin 
1105 1110 1115 1120 

Arg Gly Arg Ser Arg Gly Leu Gly Val Glu Asp Ala Thr Pro Ser Ser 
1125 1130 1135 

Ser Ser Glu Asn Cys Val Gin Glu Thr Pro Leu Val Leu Ser Arg Cys 
1140 1145 1150 

Ser Ser Val Ser Ser Leu Gly Ser Phe Glu Ser Arg Ser He Ala Ser 
1155 1160 1165 

Ser He Pro Ser Asp Pro Cys Ser Gly Leu Gly Ser Gly Thr Val Ser 
ll^O 1175 1180 

Pro Ser Glu Leu Pro Asp Ser Pro Gly Gin Thr Met Pro Pro Ser Arg 
1185 1190 1195 1200 

Ser Lys Thr Pro Pro Ala Pro Pro Gly Gin Pro Glu Thr Ser Gin Phe 
1205 1210 1215 

Ser Leu Gin Trp Glu Ser Tyr Val Lys Arg Phe Leu Asp He Ala Asp 
1220 1225 1230 

Cys Arg Glu Arg Cys Gin Pro Pro Ser Glu Leu Asp Ala Gly Ser Val 
1235 1240 1245 

Arg Phe Thr Val Glu Lys Pro Asp Glu Asn Phe Ser Cys Ala Ser Ser 
1250 1255 1260 

Leu Ser Ala Leu Ala Leu His Glu Leu Tyr Val Gin Gin Asp Val Glu 
1265 1270 1275 1280 

Leu Arg Leu Arg Pro Pro Ala Cys Pro Glu Arg Ala Val Gly Gly Gly 
1285 1290 1295 

Gly His Arg Arg Arg Asp Glu Ala Ala Ser Arg Leu Asp Gly Pro Ala 
1300 1305 1310 

Pro Ala Gly Ser Arg Ala Arg Ser Ala Thr Asp Lys Glu Leu Glu Ala 
1315 1320 1325 

Leu Arg Glu Cys Leu Gly Ala Ala Met Pro Ala Arg Leu Arg Lys Val 
1330 1335 1340 

Ala Ser Ala Leu Val Pro Gly Arg Arg Ser Leu Pro Val Pro Val Tyr 



1345 



1350 



1355 



1360 



wo 00/73328 



PCT/EPOO/05108 



91 

Met Leu Val Pro Ala Pro Ala Arg Giy A^^p A^p St^r Gly Thr A^p 3er 
1365 1370 1375 

Ala Glu Gly Thr Pro Val Asn Phe Ser Ser Ala Ala Ser Leu Ser Asp 
1380 1385 1390 

Glu Thr Leu Gin Gly Pro Ser Arg Asp Lys Pro Ala Gly Pro Gly Asp 
1395 1400 1405 

Arg Gin Lys Pro Thr Gly Arg Ala Ala Pro Ala Arg Gin Thr Arg Ser 
1410 ^ 1415 1420 

His Arg Pro Lys Ala Ala Gly Ala Gly Lys Ser Thr Glu His Thr Arg 
1425 1430 1435 1440 

Gly Pro Cys Arg Asn Arg Ala Gly Leu Glu Leu Pro Leu Ser Arg Pro 
1445 1450 1455 

Gin Ser Ala Arg Ser Asn Arg Asp Ser Ser Cys Gin Thr Arg Thr Arg 
1460 1465 1470 

Gly Asp Gly Ala Leu Gin Ser Leu Cys Leu Thr Thr Pro Thr Glu Glu 
1475 1480 1485 

Ala Val Tyr Cys Phe Tyr Asp Ser Asp Glu Glu Pro Pro Ala Thr Ala 
1490 ' 1495 1500 

Pro Pro Pro Arg Arg Ala Ser Ala lie Pro Arg Ala Leu Lys Arg Glu 
1505 1510 1515 1520 

Lys Pro Ala Gly Arg Lys Glu Thr Pro Ser Arg Ala Ala Gin Pro Ala 
1525 1530 1535 

Thr Leu Pro Val Arg Ala Gin Pro Arg Leu lie Val Asp Glu Thr Pro 
1540 1545 1550 

Pro Cys Tyr Ser Leu Thr Ser Ser Ala Ser Ser Leu Ser Glu Pro Glu 
1555 1560 1565 

Ala Pro Glu Gin Pro Ala Asn His Ala Arg Gly Pro Glu Gin Gly Ser 
1570 1575 1580 

Lys Gin Asp Ser Ser Pro Ser Pro Arg Ala Glu Glu Glu Leu Leu Gin 
1585 1590 1595 1600 

Arg Cys lie Ser Leu Ala Met Pro Arg Arg Arg Thr Gin Val Pro Gly 
1605 1610 1615 

Ser Arg Arg Arg Lys Pro Arg Ala Leu Arg Ser Asp lie Arg Pro Thr 
1620 1625 1630 

Glu lie Thr Gin Lys Cys Gin Glu Glu Val Ala Gly Ser Asp Pro Ala 
1635 1640 1645 



Ser Asp Leu Asp Ser Val Glu Trp Gin Ala lie Gin Glu Gly Ala Asn 
1650 1655 1660 
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Ser lie Val Thr Trp Leu His Gin Ala Ala Ala Lys Ala Ser Leu Glu 

1665 1670 1675 1680 

Ala Ser Ser Glu Ser Asp Ser Leu Leu Ser Leu Val Ser Glv Val Ser 
1685 1690 "^1695 

Ala Gly Ser Thr Leu Gin Pro Ser Lys Leu Arg Lvs Gly Arg Lys Pro 
1700 1705 ^ 1710 

Ala Ala Glu Ala Gly Gly Ala Trp Arg Pro Glu Lys Arg Gly Thr Thr 
1715 1720 1725 

Ser Thr Lys lie Asn Gly Ser Pro Arg Leu Pro Asn Gly Pro Glu Lys 
1730 1735 1740 

Ala Lys Gly Thr Gin Lys Met Met Ala Gly Glu Ser Thr Met Leu Arg 

1745 1750 1755 1760 

Gly Arg Thr Val He Tyr Ser Ala Gly Pro Ala Ser Arg Thr Gin Ser 
1765 1770 1775 

Lys Gly He Ser Gly Pro Cys Thr Thr Pro Lys Lys Thr Gly Thr Ser 
1780 1785 1790 

Gly Thr Thr Gin Pro Glu Thr Val Thr Lys Ala Pro Ser Pro Glu Gin 
1795 1800 1805 

Gin Arg Ser Arg Ser Leu His Arg Pro Gly Lys He Ser Glu Leu Ala 
1810 1815 1820 

Ala Leu Arg His Pro Pro Arg Ser Ala Thr Pro Pro Ala Arg Leu Ala 

1825 1830 1835 1840 

Lys Thr Pro Ser Ser Ser Ser Ser Gin Thr Ser Pro Ala Ser Gin Pro 
1845 1850 1855 

Leu Pro Arg Arg Ser Pro Leu Ala Thr Pro Thr Gly Gly Pro Leu Pro 
1860 1865 1870 

Gly Pro Gly Gly Ser Leu Val Pro Lys Ser Pro Ala Arg Ala Leu Leu 
1875 1880 1885 

Ala Lys Gin His Lys Thr Gin Lys Ser Pro Val Arg He Pro Phe Met 
1890 1895 1900 

Gin Arg Pro Ala Arg Arg Val Pro Pro Pro Leu Ala Arg Pro Ser Pro 

1905 1910 1915 1920 

Glu Pro Gly Ser Arg Gly Arg Ala Gly Ala Glu Gly Thr Pro Gly Ala 
1925 1930 1935 

Arg Gly Ser Arg Leu Gly Leu Val Arg Met Ala Ser Ala Arg Ser Ser 
1940 1945 1950 

Gly Ser Glu Ser Ser Asp Arg Ser Gly Phe Arg Arg Gin Leu Thr Phe 
1955 1960 ' 1965 

He Lys Glu Ser Pro Gly Leu Leu Arg Arg Arg Arg Ser Glu Leu Ser 
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1970 1975 1980 

Ser Ala Asp Ser Thr Ala Ser Thr Ser Gin Ala Ala Ser Pro Arg Arg 
19S5 ' 1990 1995 2000 

Glv Arg Pro Ala Leu Pro Ala Val Pne Leu Cys Ser Ser Arg Cys Asp 
2005 2010 2015 

::.u Leu Arg Val Ser Pro Arg Gin Pro Leu Ala Ala Gin Arg Ser Pro 
2020 2025 2030 

Z^T\ Ala Lys Pro Gly Leu Ala Pro Leu Ala Pro Arg Arg Thr Ser Ser 
2035 2040 2045 

L.u Sv r Pro Ser Arg Leu Pro Val Arg Ala Ser Pro Gly Arg Pro Glu 
:05j 2055 ' 2060 

Tr.r V;., Lvs Arg Tyr Ala Ser Leu Pro His lie Ser Val Ser Arg Arg 
:.OcL ^ 2070 2075 2080 

::-r Azr Ser Ala Val Ser Val Pro Thr Thr Gin Ala Asn Ala Thr Arg 
2085 2090 2095 

Ara Glv Ser Asp Gly Glu Ala Arg Pro Leu Pro Arg Val Ala Pro Pro 
2100 2105 2110 

Giy Tnr Thr Trp Arg Arg lie Lys Asp Glu Asp Val Pro His lie Leu 
2115 2120 2125 

Arg Ser Thr Leu Pro Ala Thr Ala Leu Pro Leu Arg Val Ser Ser Pro 
2130 2135 2140 

Glu Aso Ser Pro Ala Gly Thr Pro Gin Arg Lys Thr Ser Asp Ala Val 
2145 ' 2150 2155 2160 

Val Gin Thr Glu Asp Val Ala Thr Ser Lys Thr Asn Ser Ser Thr Ser 
2165 2170 2175 

Pro Ser Leu Glu Ser Arg Asp Pro Pro Gin Ala Pro Ala Ser Gly Pro 
2180 2185 2190 

Val Ala Pro Gin Gly Ser Asp Val Asp Gly Pro Val Leu Thr Lys Pro 
2195 2200 2205 

Pro Ala Ser Ala Pro Phe Pro His Glu Gly Leu Ser Ala Val lie Ala 
2210 2215 2220 

Gly Phe Pro Thr Ser Arg His Gly Ser Pro Ser Arg Ala Ala Arg Val 
2225 2230 2235 2240 

Pro Pro Phe Asn Tyr Val Pro Ser Pro Met Ala Ala Ala Thr Met Ala 
2245 2250 2255 

Ser Asp Ser Ala Val Glu Lys Ala Pro Val Ser Ser Pro Ala Ser Leu 
2260 2265 2270 



Leu Glu 
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<210> 66 
<211> 942 
<212> DNA 

<213> Caenorhabdi tis elegans 
<400> 66 

atgtacgaag atcagractg cgtggggtat aaT:t:ttctgg aaggaacagg agactttcga 60 

gaagacggtc tagaacctat caccctggca tcccatgcca cttcagatat gatgttaact 120 

ctggaaaaaa tgacctcaat grgggatgga cctatatcag ttgggatatt tattgatttt 180 

cactctagtc aagctctgga gtatctcgca gaagtgcaca gatgtgatga ggagttcagg 240 

aagaagatga caattcactt tgcaatccgt cagtcagcat tccaacaaac ttgcccaaaa 300 

attcaaattc cagcttcaga cagaacttgc tggaagttca gagcggatca atcctacctc 360 

cgaagccatc tgtcaggacc cttccaacta tatccgagca accttatgag aaatttggct 420 

cgccagggag ccaagtcgga tattcatttt attatggatg cagatatgat tgttagtgag 480 

ggattcgcgc ggaaactcaa aaaagtggca aatgagatga tcgacggaaa aagtaaaaaa 540 

gtattggcaa ttcgaagatt cgaatcggtt aai:ggaactt atttgcctag aactcacttt 600 

gagttgaagc aatctanggc ttattccaac ggatatgaat gggaagttca agtaattctt 660 

caccgtaacg acccctataa cgcggcctat ttcccgtcca gaatcaaagt tatgcactct 720 

ctgatttacg ccctttgtcg tgctggttac acgtttcatg ttccgtcaca cgttttcgat 780 

gttcatgaag ggattaagca tactaataca atttattcga aagccacaat tgctcatcag 840 

gaagcttatg cgatggacat agccggagcc agatatgtca gagaaatgga cgaaaagtac 900 

ccggacactt tggacaagtg tggaagattc aagatgtatt ag 942 

<210> 67 
<211> 313 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 67 

Met Tyr Glu Asp Gin Tyr Cys Val Gly Tyr Asn Phe Leu Glu Gly Thr 
15 10 15 

Gly Asp Phe Arg Glu Asp Gly Leu Glu Pro lie Thr Leu Ala Ser His 
20 25 30 

Ala Thr Ser Asp Met Met Leu Thr Leu Glu Lys Met Thr Ser Met Trp 
35 40 45 

Asp Gly Pro He Ser Val Gly He Phe He Asp Phe His Ser Ser Gin 
50 55 60 

Ala Leu Glu Tyr Leu Ala Glu Val His Arg Cys Asp Glu Glu Phe Arg 
65 70 75 80 

Lys Lys Met Thr He His Phe Ala He Arg Gin Ser Ala Phe Gin Gin 

85 90 95 

Thr Cys Pro Lys He Gin He Pro Ala Ser Asp Arg Thr Cys Trp Lys 
100 105 110 

Phe Arg Ala Asp Gin Ser Tyr Leu Arg Ser His Leu Ser Gly Pro Phe 
115 120 125 



Gin Leu Tyr Pro Ser Asn Leu Met Arg Asn Leu Ala Arg Gin Gly Ala 
130 135 140 
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Lys Ser Asp lie His Phe lie Met Asp Ala Asp Met lie Val Ser Glu 
145 150 155 160 

Gly Phe Ala Arg Lys Leu Lvs Lys Val Ala Asn Glu Met lie Asp Gly 

165 ^ 170 175 

Lys Ser Lvs Lys Val Leu Ala lie Arg Arg Phe Glu Ser Val Asn Gly 
180 185 190 

Thr Tyr Leu Pro Arg Thr His Phe Glu Leu Lys Gin Ser Met Ala Tyr 
195 200 205 

Ser Asn Gly Tyr Glu Trp Glu Val Gin Val lie Leu His Arg Asn Asp 
210 215 220 

Pro Tyr Asn Ala Ala Tyr Phe Pro Ser Arg lie Lys Val Met His Ser 
225 230 235 240 

Leu lie Tyr Ala Leu Cys Arg Ala Gly Tyr Thr Phe His Val Pro Ser 

245 250 255 

His Val Pne Asp Val His Glu Gly He Lys His Thr Asn Thr He Tyr 
260 265 270 

Ser Lys Ala Thr He Ala His Gin Glu Ala Tyr Ala Met Asp He Ala 
275 280 285 

Gly Ala Arg Tyr Val Arg Glu Met Asp Glu Lys Tyr Pro Asp Thr Leu 
290 295 300 

Asp Lys Cys Gly Arg Phe Lys Met Tyr 
305 310 



<210> 68 
<211> 4527 
<212> DNA 

<213> Caenorhabdit is elegans 
<400> 68 

atgcggctgc taattctcaa tatattattt gtggrttggc aaatacatgg cgtcatcgga 60 

cagagtacaa atgcagctgt ggtgtcaaca accgcggcgc cagttaaacc agccacttat 120 

atgcttgiicg ccccagcagt agtgcgaccc gatcaaccgt tctccgtttg catgaacctt 180 

ctgaaacaag ctacggatga agacatgatt gttcggattg aagtgagaac tgaacggaac 240 

gagacgactg ctgctcgagt tataagcaat ttgaagccgg gaatcgcaca gactgtcagc 300 

ttgagtgaga tgcctgctca atcactaact cctaggcaaa gctataagct cracatccgc 360 

ggggagactc ttaacgcaga actcatattt gaaaacgaaa atgagttaaa gtacgatcaa 420 

aaagcttT:at cagttttcat ccaaacagat cgagcaatct atcggcctgc ttctcttgtt 480 

cgataccgcg caattgtagt gaaatcggac ttgaaacctt acgtcggaaa cgctactatt 540 

aaaattrtcg atccaagcag aaatctgatt tcacagacga ttggtgtaac tttggataga 600 

ggagtataca gcggcgagct ccaacttgcc gaagagactc tccttggaga ttggtttatc 660 

gaggtggaaa cctcgaatgg agtccaggac aagtcgtcat ttacggtaga tacatacgtg 720 

cttcccaaat ttgaagtgaa tatcaagacg tcttcgttta taactattaa cgatgatttg 780 

tcagtgfccg tcgatgcaaa gtacacatat ggaaaaggcg ttgctggaaa ggccaaagta 840 

tctctagagc ttccatggca cagatggcat gcgatggttc ccactatcat tgacgaaaat 900 

ggagtcaaga aggaagagga gctgatggtc gagcgtactg taaagctgaa cagacaagga 960 

gaagcggcag ttgttttctc gaatgatgag ctcaagaggc acaagttgct acatgaatgg 1020 

ggaggacg::t caattcgaat tgttgcctca gtcactgaag acatcactga aatcgaaaga 1080 
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aacgctactc atcaaatttc aacgttccgr 
ggagacacct tcaagccagg actcacttat 
gacactcccg tgaaggcaac tttgccaaag 
ccatacaatc atgacacttc atctctgcaa 
gacgctcacg gcacctcggt tctcaccctc 
attgaggctc attatgatat tggtggcaag 
tcgctatatg tcgaggcagc cgtgagccca 
aatgagggag crgttgatgt aggaaagagt 
ctgtcgacaa ttacctatca ggccatgtcr 
actgttaatt ccgaacatgc gaccattagc 
tcacggttga ccgtctatgc cattattgag 
gactttaaag tagaaggaat attccaaaac 
gtcgagccag gacagaatgt gaagtttaag 
cttcttgtgg tagatcagag tgttctcttg 
aaagttgagc aagacttgga gaattatgac 
ccacggccat gggaggcaat tgatagaaag 
attggaggga gtgatgctca atcaatcttt 
gccctcttgt accgcgagcc tcaacgtgaa 
ccaggaatgg ctgaagccgc attcgccgct 
ccgacagttc gaaagttctt cccacacacc 
ggagaagttg agatggaaat tgaagctccg 
tttgcaatca atgaggaaaa tggtctcggt 
trtcgtccat tttttattca actcaatctt 
gcacttttgg tgcttgtttt caattatatg 
aagtatgata aagactctgg ttacgatctc 
gacgaggttg gccaacaaaa tgttcgtatt 
gcagtgtact tcccaattgt gccgtcttca 
attgcatccc aaggtggtga tgcagtagag 
aaagtagata gaaatattcc atttgtcatt 
aaaaatcttg agcttatctg gccaaacgac 
gatgtgattg gtgatatgat gggaccagtg 
ccatatggat gtggagaaca gaatatgttg 
Lacttaagag caacaaatag aaatgagagc 
gagcagggaa ttcagaggga gttgacgtac 
ggcgatagtg ataaagctgg ttcaacgtgg 
cacgccaaac aatacgcatt cgtggatcct 
aactctcaac aaatggagtc tggtgcattt 
atgcagggag gtgctcagga cgggggtgta 
ttggagaatg gaatggagaa tggaaaggcg 
gtatctggaa atgcgtatac aatggctgta 
aagcaagctg gaaaggcatt cgagaatttg 
gatgtgaagt ttgccagtgc tcagaaaaaa 
atgttccagg cccgtccagt agacattgaa 
gcccaaaatc aaacctcaga atctctctca 
gagctaggag gattcacttc cactcaagac 
tacgcagcag tcacttattc cgacaaacac 
catactcact catttgatat caacataaga 
tcttcattga atgatgcagt ttcaattaat 
ttgagttatt cttactacag ggactctttg 
gagatcaagg aaataagagc tggaaatcga 
cgtcctggaa agtctaacat ggctctagcc 
gatgcggaac aggtgcatac tttgacaagt 
aaggacgaca ctaagatgaa cgtatatttc 
i:cgttatact ctgatgtcac ttatcaagtt 
gtggattact acgatccgga ggagcagctc 
tcgctgcaag agaaatgtgg ggaagactgt 
tttgacgagt cgacagtaac aggcacgtcc 
atcattgctg tgctccttat tgcatga 



96 

gaggaggrca aacttgatgt tgaaaagcag 114G 
aatgttgtag ttgcactgaa acaaatggat 1200 
agagttcagg ratcaacatt ctacaactat 1260 
gaggaaaaag aaacaaaaat tgtggaagtt 1320 
caaccaccaa tcaactgcac ttctgctcgc 1380 
gacaacttca cagccacccc aatatattcc 1440 
acgaagagtt ncctgcagct tttggctgat 1500 
cttagcttct cgttgaaggc tacccagccg 1560 
cgttcgaaca ttgtggtgtc ncaacaaatg 1620 
ttcccagcaa cggctaacat ggccccaaag 1680 
agtagtcaag aagttcttgt cgacgcactt 1740 
caagtggcac tatcaattga caaacaagca 1800 
gtcacatctg acaagaactc ttttgttggg 1860 
ttaaaaactg gcaatgatat cactagagaa 1920 
tctaataatg ttggaggagg attcgggggt 1980 
aagaggtcca tttggagacc ttggtgggga 2040 
tctaatgctg ggcttgtagt tttgaccgac 2100 
tttatgtcag taatgatgat ggatggtgct 2160 
ccgccaatgg gcgggtcgtc gcctccaccg 2220 
tggatttggt cggatctcaa ctctactagt 2280 
gacaccatca cctcgcgggt cgcctccact 2340 
gtggctccaa cgacttcaaa acttcgcgtg 2400 
ccatatgctg tccgtcgtgg agaaaagttt 2460 
gaaaaggagc aagatgtaac agtgactctg 2520 
ctgaagaaag atggaactgt agtaaggcgt 2580 
gtgtccgttg ctggtggagg aacatcgaaa 2640 
attggagaga tcccggtaca tataagcgct 2700 
atgaacttga gagttgatcc acaaggatat 2760 
gacctcaaca ataactcttc ggatttctcc 2820 
gtggttgatg gatcgcagaa ggcaagatta 2880 
ttaaacaatg ctcataagct cgtacaaatg 2940 
aacttggttc caaatatttt ggtggtcaag 3000 
caattagaga caaaggctat caagtttatt 3060 
aaacgtgctg acaactcttt ctcagctttt 3120 
ctcaccgcat ttgtcgttcg atcgttccat 3180 
aatgttattt cccgagcagt cgcattcttg 3240 
gccgaacgtg gtgaagttca tcacaaggat 3300 
gctctaacgg catttgtgct aatctcgatt 3360 
gtgacgtatt tggaaaagca tttggatgaa 3420 
gtggcatatg cacttcaatt ggcaaaaagt 3480 
aagaaacata agatcgttga gaaaagtgga 3540 
gtggagaaac taaaagaatc gagagcgtat 3600 
actacctctt acgcagttct ttcttacctc 3660 
attattcgct ggttagtatc ccaacgcaac 3720 
actgtcatgg ctcttcaagc tctctcttcc 3780 
acaagtcaag taacaattct caacggaaaa 3840 
aacgcaattg ttctccaatc ctatcaacta 3900 
gccaatggaa ctggtgtggt cttcgctcag 3960 
aatgacgatg ctccgttctt ctgctcccag 4020 
ctgcaattgg atttgtgttg caactatacc 4080 
gaaattgatg ccttgagtgg ttaccgattc 4140 
attgaggatt tgcaaagggt ggagatggaa 4200 
aacccgctcg gtggccgtcc ggtatgtctc 4260 
gccgatcaaa aacctgccaa cttccgatta 4320 
aaaatgacat acgccgccaa gcaaacacga 4380 
tggccaccga tttccccatc actgcctcca 4440 
tctggttttg gtgccaagtg gtgtgctctt 4500 

4527 
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<210> 69 
<211> 4560 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 69 

atgcggctgc 

cagagtacaa 

atgcttgtcg 

ctgaaacaag 

gagacgattg 

ttgagtgaga 

ggggagactc 

aaagctttat 

cgataccggg 

aaaattttcg 

ggagtataca 

gaggtggaaa 

cttcccaaat 

tcagtgttcg 

tctctagagc 

ggagtcaaga 

gaagcggcag 

ggaggaggtt 

aacgctactc 

ggagacacct 

gacactcccg 

ccatacaatc 

gacgctcacg 

attgaggctc 

tcgctatatg 

aatgagggag 

ctgtcgacaa 

actgttaatt 

tcacggttga 

gactttaaag 

gtcgagccag 

cttcttgtgg 

aaagttgagc 

ccacggccat 

attggaggga 

gccctcttgt 

ggaggtctta 

gctccgccaa 

acctggattt 

ccggacacca 

ggtgtggctc 

cttccatatg 

atggaaaagg 

ctcctgaaga 

attgtgtccg 

tcaattggag 

gagatgaact 

attgacctca 

gacgtggttg 

gtgttaaaca 

ttgaacttgg 



taattctcaa 
atgcagctgt 
ccccagcagt 
ctacggatga 
ctgctcgagt 
tgcctgctca 
ttaacgcaga 
cagttttcat 
caattgtagt 
atccaagcag 
gcggcgagct 
cctcgaatgg 
ttgaagtgaa 
tcgatgcaaa 
t tccatggca 
aggaagagga 
ttgttttctc 
caattcgaat 
atcaaatttc 
tcaagccagg 
tgaaggcaac 
atgacacttc 
gcacctcggt 
attatgatat 
tcgaggcagc 
ctgttgatgt 
ttacctatca 
ccgaacatgc 
tcgtctatgc 
tagaaggaat 
gacagaatgt 
tagatcagag 
aagacttgga 
gggaggcaat 
gtgatgctca 
accgcgagcc 
cagtaatgat 
tgggcgggtc 
ggtcggatct 
tcacctcgtg 
caacgacttc 
ctgtccgtcg 
agcaagatgt 
aagatggaac 
ttgctggtgg 
agatcccggt 
tgagagttga 
acaataactc 
atggatcgca 
atgctcataa 
ttccaaatat 



tatattattt 
ggtgtcaaca 
agtgcgaccc 
agacatgatt 
tataagcaat 
atcactaact 
actcatattt 
ccaaacagat 
gaaatcggac 
aaatctgatt 
ccaacttgcc 
agtccaggac 
tatcaagacg 
gtacacatat 
cagatggcat 
gctgatggtc 
gaatgatgag 
tgttgcctca 
aacgttccgt 
actcacttat 
tttgccaaag 
atctctgcaa 
tctcaccctc 
tggtggcaag 
cgtgagccca 
aggaaagagt 
ggtcatgtct 
gaccattagc 
cattattgag 
attccaaaac 
gaagtttaag 
tgttctcttg 
gaattatgac 
tgatagaaag 
atcaatcttt 
tcaacgtgaa 
gatggatggt 
gtcgcctcca 
caactctact 
ggtcgcctcc 
aaaacttcgc 
tggagaaaag 
aacagtgact 
tgtagtaagg 
aggaacatcg 
acatataagc 
tccacaagga 
ttcggatttc 
gaaggcaaga 
gctcgtacaa 
tttggtggtc 



gtggtttggc 
accgcggcgc 
gatcaaccgt 
gttcggattg 
ttgaagccgg 
cctaggcaaa 
gaaaacgaaa 
cgagcaatct 
ttgaaacctt 
tcacagacga 
gaagagactc 
aagtcgtcat 
tcttcgttta 
ggaaaaggcg 
gcgatggttc 
gagcgtactg 
ctcaagaggc 
gtcactgaag 
gaggaggtca 
aatgttgtag 
agagttcagg 
gaggaaaaag 
caaccaccaa 
gacaacttca 
acgaagagtt 
cttagcttct 
cgttcgaaca 
ttcccagcaa 
agtagtcaag 
caagtggcac 
gtcacatctg 
ttaaaaactg 
tctaataatg 
aagaggtcca 
tctaatgctg 
tttatgtcag 
gctccaggaa 
ccgccgacag 
agtggagaag 
acttttgcaa 
gtgtttcgtc 
tttgcacttt 
ctgaagtatg 
cgtgacgagg 
aaagcagtgt 
gctattgcat 
tataaagtag 
tccaaaaatc 
ttagatgtga 
atgccatatg 
aagtacttaa 



aaatacatgg 
cagttaaacc 
tctccgtttg 
aagtgagaac 
gaatcgcaca 
gctataagct 
atgagttaaa 
atcggcctgc 
acgtcggaaa 
ttggtgtaac 
tccttggaga 
ttacggtaga 
taactattaa 
ttgctggaaa 
ccactatcat 
taaagctgaa 
acaagttgct 
acatcactga 
aacttgatgt 
ttgcactgaa 
tatcaacatt 
aaacaaaaat 
tcaactgcac 
cagccacccc 
tcctgcagct 
cgttgaaggc 
ttgtggtgtc 
cggctaacat 
aagttcttgt 
tatcaattga 
acaagaactc 
gcaatgatat 
ttggaggagg 
tttggagacc 
ggcttgtagt 
aaagaagact 
tggctgaagc 
ttcgaaagtt 
ttgagatgga 
tcaatgagga 
cattttttat 
tggtgcttgt 
ataaagactc 
ttggccaaca 
acttcccaat 
cccaaggtgg 
atagaaatat 
ttgagcttat 
ttggtgatat 
gatgtggaga 
gagcaacaaa 



cgtcatcgga 
agccacttat 
catgaacctt 
tgaacggaac 
gactgtcagc 
ctacatccgc 
gtacgatcaa 
ttctcttgtt 
cgctactatt 
tttggataga 
ttggtttatc 
tacatacgtg 
cgatgatttg 
ggccaaagta 
tgacgaaaat 
cagacaagga 
acatgaatgg 
aatcgaaaga 
tgaaaagcag 
acaaatggat 
ctacaactat 
tgtggaagtt 
ttctgctcgc 
aatatattcc 
tttggctgat 
tacccagccg 
tcaacaaatg 
ggccccaaag 
tgatgcactt 
caaacaagca 
ttttgttggg 
cactagagaa 
attcgggggt 
ttggtgggga 
tttgaccgac 
taatacaccc 
cgcattcgcc 
cttcccacac 
aattgaagct 
aaatggtctc 
tcaactcaat 
tttcaattat 
tggttacgat 
aaatgttcgt 
tgtgccgtct 
tgatgcagta 
tccatttgtc 
ctggccaaac 
gatgggacca 
acagaatatg 
tagaaatgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

27O0 

2760 

2820 

2880 

2940 

3000 

3060 
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agccaattag 
tacaaacgtg 
tggctcaccg 
cctaatgtta 
tttgccgaac 
gtagctctaa 
gcggtgacgt 
gtagtggcat 
ttgaagaaac 
aaagtggaga 
gaaactacct 
tcaattattc 
gacactgtca 
cacacaagtc 
agaaacgcaa 
aatgccaatg 
ttgaatgacg 
cgactgcaat 
gccgaaattg 
agtattgagg 
ttcaacccgc 
gttgccgatc 
ctcaaaatga 
tgttggccac 
tcctctggtt 



agacaaaggc 
ctgacaactc 
catttgtcgt 
tttcccgagc 
gtggtgaagt 
cggcatttgt 
atttggaaaa 
atgcacttca 
ataagatcgt 
aactaaaaga 
cttacgcagt 
gttggttagt 
tggctcttca 
aagtaacaat 
ttgttctcca 
gaactggtgt 
atgctccgtt 
tggatttgtg 
atgccttgag 
atttgcaaag 
tcggtggccg 
aaaaacctgc 
catacgccgc 
cgatttcccc 
ttggtgccaa 



tatcaagttt 
tttctcagct 
tcgatcgttc 
agtcgcattc 
tcatcacaag 
gctaatctcg 
gcatttggat 
attggcaaaa 
tgagaaaagt 
atcgagagcg 
tctttcttac 
atcccaacgc 
agctctctct 
tctcaacgga 
atcctatcaa 
ggtcttcgct 
cttctgctcc 
ttgcaactat 
tggttaccga 
ggtggagatg 
tccggtatgt 
caacttccga 
caagcaaaca 
atcactgcct 
gtggtgtgct 



98 

attgagcagg 
tttggcgata 
catcacgcca 
ttgaactctc 
gatatgcagg 
attttggaga 
gaagtatctg 
agtaagcaag 
ggagatgtga 
tatatgttcc 
ctcgcccaaa 
aacgagctag 
tcctacgcag 
aaacatactc 
ctatcttcat 
cagttgagtt 
caggagatca 
acccgtcctg 
ttcgatgcgg 
gaaaaggacg 
ctctcgttat 
ttagtggatt 
cgatcgctgc 
ccatttgacg 
cttatcattg 



gaattcagag 
gtgataaagc 
aacaatacgc 
aacaaatgga 
gaggtgctca 
a tggaatgga 
gaaatgcgta 
ctggaaaggc 
agtttgccag 
aggcccgtcc 
atcaaacctc 
gaggattcac 
cagtcactta 
actcatttga 
tgaatgatgc 
attcttacta 
aggaaataag 
gaaagtctaa 
aacaggtgca 
acactaagat 
actctgatgt 
actacgatcc 
aagagaaatg 
agtcgacagt 
ctgtgctcct 



ggagt tgacg 
tggttcaacg 
att cgtggat 
gtctggtgca 
ggacgggggt 
gaatggaaag 
tacaatggct 
att cgagaa t 
tgctcagaaa 
agtagacat t 
agaatctctc 
ttccactcaa 
ttccgacaaa 
tatcaacata 
agtttcaatt 
cagggactct 
agctggaaat 
catggctcta 
tactttgaca 
gaacgtatat 
cacttat caa 
ggaggagcag 
tggggaagac 
aacaggcacg 
tattgcatga 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 



<210> 70 
<211> 1508 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 70 

Met Arg Leu Leu lie Leu Asn lie Leu Phe Val Val Trp Gin lie His 
15 10 15 



Gly Val lie Gly Gin Ser Thr Asn Ala Ala Val Val Ser Thr Thr Ala 



20 



25 



30 



Ala Pro Val Lys Pro Ala Thr Tyr Met Leu Val Ala Pro Ala Val Val 



35 



40 



45 



Arg Pro Asp Gin Pro Phe Ser Val Cys Met Asn Leu Leu Lys Gin Ala 



50 



55 



60 



Thr Asp Glu Asp Met lie Val Arg lie Glu Val Arg Thr Glu Arg Asn 
65 70 75 80 



Glu Thr lie Ala Ala Arg Val lie Ser Asn Leu Lys Pro Gly lie Ala 



85 



90 



95 



Gin Thr Val Ser Leu Ser Glu Met Pro Ala Gin Ser Leu Thr Pro Arg 



100 



105 



110 



Gin Ser Tyr Lys Leu Tyr lie Arg Gly Glu Thr Leu Asn Ala Glu Leu 



115 



120 



125 



lie Phe Glu Asn Glu Asn Glu Leu Lys Tyr Asp Gin Lys Ala Leu Ser 



130 



135 



140 
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Y^]_ pv^^^ lis Gin Thi;" 7\3p Airg Alci lis Tyr Arg Pre Alci Scr Lsu V5.I 
145 150 155 160 

Arg Tyr Arg Ala lie Val Val Lys Ser Asp Leu Lys Pro Tyr Val Giy 

165 170 175 

Asn Ala Thr lie Lys lie Phe Asp Pro Ser Arg Asn Leu lie Ser Gin 
180 185 190 

Thr lie Giy Val Thr Leu Asp Arg Giy Val Tyr Ser Giy Giu Leu Gin 
195 200 205 

Leu Ala Glu Giu Thr Leu Leu Giy Asp Trp Phe lie Giu Val Giu Thr 
210 215 220 

Ser Asn Giy Val Gin Asp Lys Ser Ser Phe Thr Val Asp Thr Tyr Val 
225 230 235 240 

Leu Pro Lys Phe Giu Val Asn lie Lys Thr Ser Ser Phe lie Thr lie 

245 250 255 

Asn Asp Asp Leu Ser Val Phe Val Asp Ala Lys Tyr Thr Tyr Giy Lys 
260 265 270 

Giy Val Ala Giy Lys Ala Lys Val Ser Leu Giu Leu Pro Trp His Arg 
275 280 285 

Trp His Ala Met Val Pro Thr lie lie Asp Giu Asn Giy Val Lys Lys 
290 295 300 

Glu Giu Glu Leu Met Val Giu Arg Thr Val Lys Leu Asn Arg Gin Giy 
305 310 315 320 

Giu Ala Ala Val Val Phe Ser Asn Asp Giu Leu Lys Arg His Lys Leu 

325 330 335 

Leu His Giu Trp Giy Giy Giy Ser lie Arg lie Val Ala Ser Val Thr 
340 345 350 

Giu Asp lie Thr Glu lie Giu Arg Asn Ala Thr His Gin lie Ser Thr 
355 360 365 

Phe Arg Giu Glu Val Lys Leu Asp Val Giu Lys Gin Giy Asp Thr Phe 
370 375 380 

Lys Pro Giy Leu Thr Tyr Asn Val Val Val Ala Leu Lys Gin Met Asp 
385 390 395 400 

■ Asp Thr Pro Vai Lys Ala Thr Leu Pro Lys Arg Val Gin Val Ser Thr 

405 410 415 

Phe Tyr Asn Tyr Pro Tyr Asn His Asp Thr Ser Ser Leu Gin Giu Glu 
420 425 430 

Lys Glu Thr Lys lie Vai Glu Vai Asp Ala His Giy Thr Ser Vai Leu 
435 440 445 
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Thr Leu Gin Pro Pro 
450 

Tvr Asp lie Glv Glv 
465 

Ser Leu Tyr Vai Giu 

485 

Leu Leu Ala Asp Asn 
500 

Phe Ser Leu Lys Ala 
515 

Met Ser Arg Ser Asn 
530 

Giu His Ala Thr lie 
545 

Ser Arg Leu lie Val 

565 

Val Asp Ala Leu Asp 
5S0 

Aia Leu Ser lie Asp 
595 

Phe Lys Val Thr Ser 
610 

Asp Gin Ser Vai Leu 
625 

Lys Val Giu Gin Asp 

645 

Giy Phe Gly Gly Pro 

660 

Ser lie Trp Arg Pro 
675 

lie Phe Ser Asn Ala 
690 

Arg Giu Pro Gin Arg 
705 

Pro Gly Met Ala Giu 
725 

Ser Pro Pro Pro Pro 
740 

Trp Ser Asp Leu Asn 



100 

lie Asn Cys Thr Ser Ala 
455 

Lvs Asp Asn Phe Thr Ala 
470 475 

Ala Ala Val Ser Pro Thr 

490 

Giu Gly Ala Val Asp Val 
505 

Thr Gin Pro Leu Ser Thr 
520 

lie Val Vai Ser Gin Gin 
535 

Ser Phe Pro Ala Thr Aia 
550 555 

Tyr Aia lie lie Giu Ser 

570 

Phe Lys Val Giu Giy lie 
585 

Lvs Gin Ala Val Giu Pro 
600 

Asp Lys Asn Ser Phe Vai 
615 

Leu Leu Lys Thr Giy Asn 
630 635 

Leu Giu Asn Tyr Asp Ser 

650 

Arg Pro Trp Giu Aia lie 
665 

Trp Trp Giy lie Gly Giy 
680 

Gly Leu Val Vai Leu Thr 
695 

Giu Phe Met Ser Val Met 
710 715 

Ala Aia Phe Aia Ala Pro 

730 

Thr Val Arg Lys Phe Phe 
745 

Ser Thr Ser Giy Giu Val 



Arg lie Giu Ala His 
460 

Thr Pro lie Tyr Ser 

480 

Lys Ser Phe Leu Gin 
495 

Giy Lys Ser Leu Ser 
510 

lie Thr Tyr Gin Val 
525 

Met Thr Vai Asn Ser 
540 

Asn Met Ala Pro Lys 

560 

Ser Gin Giu Vai Leu 
575 

Phe Gin Asn Gin Val 
590 

Gly Gin Asn Vai Lys 
605 

Giy Leu Leu Vai Val 
620 

Asp lie Thr Arg Giu 

640 

Asn Asn Vai Glv Gly 
655 

Asp Arg Lys Lys Arg 
670 

Ser Asp Aia Gin Ser 
685 

Asp Aia Leu Leu Tyr 
700 

Met Met Asp Giy Ala 

720 

Pro Met Giy Gly Ser 
735 

Pro His Thr Trp lie 
750 

Giu Met Giu lie Giu 
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755 760 765 

Ala Pro Asp Thr lie Thr Ser Trp Val Ala Ser Thr Phe Ala lie Asn 
770 775 780 

Glu Glu Asn Gly Leu Gly Val Ala Pro Thr Thr Ser Lys Leu Arg Val 
785 790 795 800 

Phe Arg Pro Phe Phe lie Gin Leu Asn Leu Pro Tyr Ala Val Arg Arg 

805 810 815 

Gly Glu Lys Phe Ala Leu Leu Val Leu Val Phe Asn Tyr Met Glu Lys 
820 825 830 

Glu Gin Asp Val Thr Val Thr Leu Lys Tyr Asp Lys Asp Ser Gly Tyr 
835 840 845 

Asp Leu Leu Lys Lys Asp Gly Thr Val Val Arg Arg Asp Glu Val Gly 
850 855 860 

Gin Gin Asn Val Arg lie Val Ser Val Ala Gly Gly Gly Thr Ser Lys 
865 870 875 880 

Ala Val Tyr Phe Pro lie Val Pro Ser Ser lie Gly Glu lie Pro Val 

885 890 895 

His lie Ser Ala lie Ala Ser Gin Gly Gly Asp Ala Val Glu Met Asn 
900 905 910 

Leu Arg Val Asp Pro Gin Gly Tyr Lys Val Asp Arg Asn lie Pro Phe 
915 920 925 

Val lie Asp Leu Asn Asn Asn Ser Ser Asp Phe Ser Lys Asn Leu Glu 
930 935 940 

Leu lie Trp Pro Asn Asp Val Val Asp Gly Ser Gin Lys Ala Arg Leu 
945 950 955 960 

Asp Val lie Gly Asp Met Met Gly Pro Val Leu Asn Asn Ala His Lys 

965 970 975 

Leu Val Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Leu Asn Leu 
980 985 990 

Val Pro Asn lie Leu Val Val Lys Tyr Leu Arg Ala Thr Asn Arg Asn 
995 1000 1005 

Glu Ser Gin Leu Glu Thr Lys Ala lie Lys Phe lie Glu Gin Gly lie 
1010 1015 1020 

Gin Arg Glu Leu Thr Tyr Lys Arg Ala Asp Asn Ser Phe Ser Ala Phe 
1025 1030 1035 1040 

Gly Asp Ser Asp Lys Ala Gly Ser Thr Trp Leu Thr Ala Phe Val Val 
1045 1050 1055 



Arg Ser Phe His His Ala Lys Gin Tyr Ala Phe Val Asp Pro Asn Val 
1060 1065 1070 
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lie Ser Arg Ala Val Ala Phe Leu Asn Ser Gin Gin Met Glu Ser Gly 
1075 1080 1085 

Ala Phe Ala Glu Arg Gly Glu Val His His Lys Asp Met Gin Gly Gly 
1090 1095 1100 

Ala Gin Asp Gly Gly Val Ala Leu Thr Ala Phe Val Leu lie Ser lie 
1105 1110 1115 1120 

Leu Glu Asn Gly Met Glu Asn Gly Lys Ala Val Thr Tyr Leu Glu Lys 
1125 1130 1135 

His Leu Asp Glu Val Ser Gly Asn Ala Tyr Thr Met Ala Val Val Ala 
1140 1145 1150 

Tyr Ala Leu Gin Leu Ala Lys Ser Lys Gin Ala Gly Lys Ala Phe Glu 
1155 1160 1165 

Asn Leu Lys Lys His Lys lie Val Glu Lys Ser Gly Asp Val Lys Phe 
1170 1175 1180 

Ala Ser Ala Gin Lys Lys Val Glu Lys Leu Lys Glu Ser Arg Ala Tyr 
1185 1190 1195 1200 

Met Phe Gin Ala Arg Pro Val Asp lie Glu Thr Thr Ser Tyr Ala Val 
1205 1210 1215 

Leu Ser Tyr Leu Ala Gin Asn Gin Thr Ser Glu Ser Leu Ser lie lie 
1220 1225 1230 

Arg Trp Leu Val Ser Gin Arg Asn Glu Leu Gly Gly Phe Thr Ser Thr 
1235 1240 1245 

Gin Asp Thr Val Met Ala Leu Gin Ala Leu Ser Ser Tyr Ala Ala Val 
1250 1255 1260 

Thr Tyr Ser Asp Lys His Thr Ser Gin Val Thr lie Leu Asn Gly Lys 
1265 1270 1275 1280 

His Thr His Ser Phe Asp lie Asn lie Arg Asn Ala lie Val Leu Gin 
1285 1290 1295 

Ser Tyr Gin Leu Ser Ser Leu Asn Asp Ala Val Ser lie Asn Ala Asn 
1300 1305 1310 

Gly Thr Gly Val Val Phe Ala Gin Leu Ser Tyr Ser Tyr Tyr Arg Asp 
1315 1320 1325 

Ser Leu Asn Asp Asp Ala Pro Phe Phe Cvs Ser Gin Glu lie Lys Glu 
1330 1335 ' 1340 

lie Arg Ala Gly Asn Arg Leu Gin Leu Asp Leu Cys Cys Asn Tyr Thr 
1345 1350 1355 1360 



Arg Pro Gly Lys Ser Asn Met Ala Leu Ala Glu lie Asp Ala Leu Ser 
1365 1370 1375 
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Gly Tyr Arg Phe Asp Ala Glu Gin Val His Thr Leu Thr Ser lie Giu 



Asp Leu Gin Arg Val Glu Met Glu Lys Asp Asp Thr Lvs Met Asn Vai 
1395 1400 ' 1405 

Tyr Phe Asn Pro Leu Gly Gly Arg Pro Val Cys Leu Ser Leu Tyr Ser 
141C 1415 ' 1420 

Asp Val Thr Tyr Gin Val Ala Asp Gin Lys Pro Ala Asn Phe Arg Leu 
1425 1430 1435 1440 

Val Asp Tyr Tyr Asp Pro Glu Glu Gin Leu Lys Met Thr Tyr Ala Ala 
1445 1450 1455 

Lys Gin Thr A.rg Ser Leu Gin Glu Lys Cys Gly Glu Asp Cys Trp Pro 
1460 1465 1470 

Pro lie Ser Pro Ser Leu Pro Pro Phe Asp Glu Ser Thr Val Thr Gly 
1475 1480 1485 

Thr Ser Ser Gly Phe Glv Ala Lys Trp Cys Ala Leu lie lie Ala Val 
1490 ' 1495 1500 

Leu Leu He Ala 
1505 



<210> 71 
<211> 1519 
<212> PRT 

<213> Caenorhabdir is elegans 
<400> 71 

Met Arg Leu Leu He Leu Asn He Leu Phe Val Val Trp Gin He His 
15 10 15 

Gly Val lie Gly Gin Ser Thr Asn Ala Ala Val Val Ser Thr Thr Ala 
20 25 30 

Ala Pro Val Lys Pro Ala Thr Tyr Met Leu Val Ala Pro Ala Val Val 
35 ^ 40 45 

Arg Pro Asp Gin Pro Phe Ser Vai Cys Met Asn Leu Leu Lys Gin Ala 
50 55 60 

Thr Asp Glu Asp Met He Val Arg He Giu Val Arg Thr Giu Arg Asn 
65 70 75 80 

Giu Thr He Ala Ala Arg Val He Ser Asn Leu Lys Pro Gly He Ala 

85 90 95 

Gin Thr Val Ser Leu Ser Glu Met Pro Ala Gin Ser Leu Thr Pro Arg 
100 105 110 



Gin Ser Tyr Lys Leu Tyr He Arg Gly Giu Thr Leu Asn Ala Glu Leu 
115 120 125 
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lie Phe Glu Asn Glu Asn Glu Leu Lvs Tyr Asd Gin Lys Ala Leu Ser 
130 135 140 

Val Phe He Gin Thr Asp Arg Ala He Tyr Arg Pro Ala Ser Leu Val 

150 155 160 

Arg Tyr Arg Ala He Val Val Lys Ser Asp Leu Lys Pro Tyr Val Gly 

165 170 175 

.•■.-r; Ala Thr He Lys He Phe Asp Pro Ser Arg Asn Leu He Ser Gin 
180 185 190 

ir.: Gly Val Thr Leu Asp Arg Gly Val Tyr Ser Gly Glu Leu Gin 

195 200 205 

Aid: Glu Glu Thr Leu Leu Gly Asp Trp Phe He Glu Val Glu Thr 

215 220 

r Asn Gly Val Gin Asp Lys Ser Ser Phe Thr Val Asp Thr Tyr Val 
— ' 230 235 240 

J P:: Lys Phe Glu Val Asn He Lys Thr Ser Ser Phe He Thr He 
245 250 255 



Asp Leu Ser Val Phe Val Asp Ala Lys Tyr Thr Tyr Gly Lys 
260 265 270 

Gly Val Ala Gly Lys Ala Lys Val Ser Leu Glu Leu Pro Trp His Arg 
275 280 285 

Trp Hi-. Ala Met Val Pro Thr He He Asp Glu Asn Gly Val Lys Lys 
290 295 300 

Glu Glu Glu Leu Met Val Glu Arg Thr Val Lys Leu Asn Arg Gin Gly 
305 310 315 320 

Glu Ala Ala Val Val Phe Ser Asn Asp Glu Leu Lys Arg His Lys Leu 

325 330 335 

Leu His Glu Trp Gly Gly Gly Ser He Arg lie Val Ala Ser Val Thr 
340 345 350 

Glu Asp He Thr Glu He Glu Arg Asn Ala Thr His Gin He Ser Thr 
355 360 365 

Phe Arg Glu Glu Val Lys Leu Asp Val Glu Lys Gin Gly Asp Thr Phe 
370 375 380 

Lys Pro Gly Leu Thr Tyr Asn Val Val Val Ala Leu Lys Gin Met Asp 
3S5 390 395 400 

Asp Thr Pro Val Lys Ala Thr Leu Pro Lys Arg Val Gin Val Ser Thr 

405 410 415 

Phe Tyr Asn Tyr Pro Tyr Asn His Asp Thr Ser Ser Leu Gin Glu Glu 
420 425 430 

Lys Glu Thr Lys He Val Glu Val Asp Ala His Gly Thr Ser Val Leu 
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435 440 445 

Thr Leu Gin Pro Pro lie Asn Cys Thr Ser Ala Arg lie Glu Ala His 
450 455 460 

Tyr Asp lie Gly Gly Lys Asp Asn Phe Thr Ala Thr Pro lie Tyr Ser 
465 470 475 480 

Ser Leu Tyr Vai Glu Ala Ala Vai Ser Pro Thr Lys Ser Phe Leu Gin 

485 490 495 

Leu Leu Ala Asp . Asn Glu Gly Ala Vai Asp Vai Gly Lys Ser Leu Ser 
500 505 510 

Phe Ser Leu Lys Ala Thr Gin Pro Leu Ser Thr lie Thr Tyr Gin Vai 
515 520 525 

Met Ser Arg Ser Asn lie Vai Vai Ser Gin Gin Met Thr Vai Asn Ser 
530 535 540 

Glu His Ala Thr lie Ser Phe Pro Ala Thr Ala Asn Met Aia Pro Lys 
545 550 555 560 

Ser Arg Leu lie Vai Tyr Aia lie lie Giu Ser Ser Gin Giu Vai Leu 

565 570 575 

Vai Asp Aia Leu Asp Phe Lys Vai Glu Giy lie Phe Gin Asn Gin Vai 
580 585 590 

Aia Leu Ser lie Asp Lys Gin Ala Vai Giu Pro Gly Gin Asn Vai Lys 
595 600 605 

Phe Lys Vai Thr Ser Asp Lys Asn Ser Phe Vai Gly Leu Leu Vai Vai 
610 615 620 

Asp Gin Ser Vai Leu Leu Leu Lys Thr Giy Asn Asp lie Thr Arg Giu 
625 630 635 640 

Lys Vai Giu Gin Asp Leu Giu Asn Tyr Asp Ser Asn Asn Vai Gly Gly 

645 650 655 

Giy Phe Gly Gly Pro Arg Pro Trp Glu Ala lie Asp Arg Lys Lys Arg 
660 665 670 

Ser lie Trp Arg Pro Trp Trp Giy lie Gly Giy Ser Asp Aia Gin Ser 
675 680 685 

lie Phe Ser Asn Aia Gly Leu Vai Vai Leu Thr Asp Aia Leu Leu Tyr 
690 695 700 

Arg Glu Pro Gin Arg Glu Phe Met Ser Glu Arg Arg Leu Asn Thr Pro 
705 710 715 720 

Gly Gly Leu Thr Vai Met Met Met Asp Gly Aia Pro Gly Met Ala Glu 
725 730 735 

Ala Aia Phe Aia Aia Pro Pro Met Giy Giy Ser Ser Pro Pro Pro Pro 
740 745 750 
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Thr Vai Arg Lys Phe Phe Pro His Thr Trp lie Trp Ser Asp Leu Asn 
755 760 765 

Ser Thr Ser Gly Glu Val Glu Met Glu lie Glu Ala Pro Asp Thr lie 
770 775 780 

Thr Ser Trp Val Ala Ser Thr Phe Ala He Asn Glu Glu Asn Glv Leu 
785 790 795 ' 800 

Gly Vai Ala Pro Thr Thr Ser Lys Leu Arg Val Phe Arg Pro Phe Phe 

805 810 815 

He Gin Leu Asn Leu Pro Tyr Ala Val Arg Arg Gly Glu Lys Phe Ala 
820 825 830 

Leu Leu Vai Leu Val Phe Asn Tyr Met Glu Lys Glu Gin Asp Vai Thr 
835 840 845 

Vai Thr Leu Lys Tyr Asp Lys Asp Ser Gly Tyr Asp Leu Leu Lys Lys 
85C 855 860 

Asp Gly Thr Vai Val Arg Arg Asp Glu Val Gly Gin Gin Asn Vai Arg 
865 870 875 880 

He Val Ser Val Ala Gly Gly Gly Thr Ser Lys Ala Val Tyr Phe Pro 
885 890 895 

He Val Pro Ser Ser He Gly Glu He Pro Val His He Ser Ala He 
900 905 910 

Ala Ser Gin Gly Gly Asp Ala Vai Glu Met Asn Leu Arg Val Asp Pro 
915 920 925 

Gin Gly Tyr Lys Val Asp Arg Asn He Pro Phe Val He Asp Leu Asn 
930 935 940 

Asn Asn Ser Ser Asp Phe Ser Lys Asn Leu Glu Leu He Trp Pro Asn 
945 950 955 960 

Asp Vai Vai Asp Gly Ser Gin Lys Ala Arg Leu Asp Val He Gly Asp 

965 970 975 

Met Men Gly Pro Val Leu Asn Asn Ala His Lys Leu Val Gin Met Pro 
980 985 990 

Tyr Gly Cys Gly Glu Gin Asn Met Leu Asn Leu Val Pro Asn He Leu 
995 1000 1005 

Val Vai Lys Tyr Leu Arg Ala Thr Asn Arg Asn Glu Ser Gin Leu Glu 
1010 1015 1020 

Thr Lys Ala He Lys Phe He Glu Gin Gly He Gin Arg Glu Leu Thr 
1025 1030 1035 1040 



Tyr Lys Arg Ala Asp Asn Ser Phe Ser Ala Phe Gly Asp Ser Asp Lys 
1045 1050 1055 
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Ala Gly Ser Thr Trp Leu Thr Ala Phe Val Val Arg Ser Phe His His 

-1 ^ rs -I /~\ ^ I- ~i ^ 

XVJOU XU03 XU/U 

Aia Lys Gin Tyr Ala Phe Val Asp Pro Asn Val lie Ser Arg Ala Val 
1075 1080 1085 

Ala Pne Leu Asn Ser Gin Gin Met Glu Ser Gly Ala Phe Ala Glu Arg 
1090 1095 1100 

Gly GLu Val His His Lys Asp Met Gin Gly Gly Ala Gin Asp Gly Gly 

:::5 mo 1115 1120 

V^: Aic. Leu Thr Ala Phe Val Leu lie Ser lie Leu Glu Asn Gly Met 
1125 1130 1135 

Ai^:. Gly Lys Ala Val Thr Tyr Leu Glu Lys His Leu Asp Glu Val 
1140 1145 " 1150 

Gly Asn Ala Tyr Thr Met Ala Val Val Ala Tyr Ala Leu Gin Leu 
:i55 1160 1165 

a:-. I.yr Ser Lys Gin Ala Gly Lys Ala Phe Glu Asn Leu Lys Lys His 

1175 1180 

Lys lie Val Glu Lys Ser Gly Asp Val Lys Phe Aia Ser Ala Gin Lys 
1165 1190 1195 1200 

Lys Val Glu Lys Leu Lys Glu Ser Arg Ala Tyr Met Phe Gin Ala Arg 
1205 1210 1215 

Pro Val Asp lie Glu Thr Thr Ser Tyr Ala Val Leu Ser Tyr Leu Ala 
1220 1225 1230 

Gin Asn Gin Thr Ser Glu Ser Leu Ser lie lie Arg Trp Leu Val Ser 
1235 1240 1245 

Gin Arc Asn Glu Leu Gly Gly Phe Thr Ser Thr Gin Aso Thr Val Met 
125C 1255 1260 

Ala Leu Gin Ala Leu Ser Ser Tyr Ala Ala Val Thr Tyr Ser Asp Lys 
1265 1270 1275 1280 

His Thr Ser Gin Val Thr lie Leu Asn Gly Lys His Thr His Ser Phe 
1285 1290 1295 

Asp lie Asn lie Arg Asn Ala lie Val Leu Gin Ser Tyr Gin Leu Ser 
1300 1305 1310 

Ser Leu Asn Asp Ala Val Ser lie Asn Ala Asn Gly Thr Gly Val Val 
1315 1320 1325 

Phe Ala Gin Leu Ser Tyr Ser Tyr Tyr Arg Asp Ser Leu Asn Asp Asp 
1330 1335 1340 

Ala Pro Phe Phe Cys Ser Gin Glu lie Lys Glu lie Arg Ala Gly Asn 
1345 1350 1355 1360 



Arg Leu Gin Leu Asp Leu Cys Cys Asn Tyr Thr Arg Pro Gly Lys Ser 
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1365 1370 1375 

Asn Met Ala Leu Ala Glu lie Asp Ala Leu Ser Gly Tyr Arg Phe Asp 
1380 1385 1390 

Ala Glu Gin Val His Thr Leu Thr Ser lie Glu Asp Leu Gin Arg Val 
1395 1400 1405 

Glu Met Glu Lys Asp Asp Thr Lys Met Asn Val Tyr Phe Asn Pro Leu 
1410 1415 1420 

Gly Gly Arg Pro Val Cys Leu Ser Leu Tyr Ser Asp Val Thr Tyr Gin 
1425 1430 1435 1440 

Val Ala Asp Gin Lys Pro Ala Asn Phe Arg Leu Val Asp Tyr Tyr Asp 
1445 1450 1455 

Pro Glu Glu Gin Leu Lys Met Thr Tyr Ala Ala Lys Gin Thr Arg Ser 
1460 1465 1470 

Leu Gin Glu Lys Cys Gly Glu Asp Cys Trp Pro Pro lie Ser Pro Ser 
1475 1480 1485 

Leu Pro Pro Phe Asp Glu Ser Thr Val Thr Gly Thr Ser Ser Gly Phe 
1490 1495 1500 

Gly Ala Lys Trp Cys Ala Leu lie lie Ala Val Leu Leu lie Ala 
1505 1510 1515 



<210> 72 
<211> 1026 
<212> DNA 

<213> Caenorhabdi tis elegans 
<400> 72 

atgaatattg tgtgggttat aattttttgg aagctacaga aaggcatttt ccgagaagat 60 

ggacttgaac cagtaacact agcagttcat ggtactgcag aaatgatgga aatgatagaa 120 

aataagcctg agaactggga tgggcctata tcatttggat tgtttattga ttttcattct 180 

agacaaattc tggattacgt cgctaaggtt tatagctgcg atgaggagtt tcagaaaaag 240 

gttaccgtac actttgcatt ccgtctatca ccctttcaaa ctagctgccc acaaatcaaa 300 

gtctcaccgt caactctcga atgtggcgag ttcctctcca acagaaaaaa gtttcgacgt 360 

gctgtaggcg actcatttca attgtaccca agtaacttga tgagaaatat tgcaagaaaa 420 

ggtgccaaat cggatattca tttcattgtc gatggtgata tgataatgag tgatggattt 480 

gcggaaaaga ttaaaccaat rgcaaatcaa attgttgatg ggaaaaacaa aaatgtattg 540 

gttgttagaa gatttgaaac aaatgaaacg acaatacccc acaaccatat agaattgaaa 600 

aatgcaattg aaaataaaca agtcttccaa tttcatcaca gatttttctt tgctgggcat 660 

aaaatttcaa atatttcaca ctggtttgcg gtttccaatg aaaccgacga aattactgct 720 

tgggaaatac cttactcgag cagcttatgg gaagttcaag tgattcttca tcgtaacgac 780 

rtgtacaatg cggattactt tccagcgaga arcaaggtta tgcagtcatt ggtgtacagc 840 

ctgtgcagag ccaactatac gttcaacctc ctatcacacg ttttcaacgt gcacaaagga 900 

atcaaattag gcgatactaa cttctcaaaa tccgttatag cacactctaa gagaaatgga 960 

agaaatagcg aactccagga tacataccca gatacgttgg atagatgcgg tcaatttgta 1020 
atgtaa 1026 

<210> 73 
<211> 341 
<212> PRT 
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<213> Caenorhabdit is elegans 
<400> 73 

Met Asn lie Vai Trp Val lie lie Phe Trp Lys Leu Gin Lvs Gly lie 
1 5 10 ' 15 

Phe Arg Glu Asp Gly Leu Glu Pro Val Thr Leu Ala Val His Gly Thr 
20 25 30 

Ala Glu Met Met Glu Met lie Glu Asn Lys Pro Glu Asn Trp Asp Gly 
35 40 45 

Pro lie Ser Phe Gly Leu Phe lie Asp Phe His Ser Arg Gin lie Leu 
50 55 60 

Asp Tyr Val Ala Lys Val Tyr Ser Cys Asp Glu Glu Phe Gin Lys Lys 
65 70 75 80 

Val Thr Val His Phe Ala Phe Arg Leu Ser Pro Phe Gin Thr Ser Cys 

85 90 95 

Pro Gin lie Lys Val Ser Pro Ser Thr Leu Glu Cys Gly Glu Phe Leu 
100 105 110 

Ser Asn Arg Lys Lys Phe Arg Arg Ala Val Gly Asp Ser Phe Gin Leu 
115 120 125 

Tyr Pro Ser Asn Leu Met Arg Asn lie Ala Arg Lys Gly Ala Lys Ser 
130 135 140 

Asp lie His Phe He Val Asp Gly Asp Met He Met Ser Asp Gly Phe 
145 150 155 160 

Ala Glu Lys He Lys Pro He Ala Asn Gin He Val Asp Gly Lys Asn 

165 170 175 

Lys Asn Val Leu Val Val Arg Arg Phe Glu Thr Asn Glu Thr Thr He 
180 185 190 

Pro His Asn His He Glu Leu Lys Asn Ala He Glu Asn Lys Gin Val 
195 200 205 

Phe Gin Phe His His Arg Phe Phe Phe Ala Gly His Lys He Ser Asn 
210 215 220 

He Ser His Trp Phe Ala Val Ser Asn Glu Thr Asp Glu He Thr Ala 
225 230 235 240 

Trp Glu He Pro Tyr Ser Ser Ser Leu Trp Glu Val Gin Val He Leu 

245 250 255 

His Arg Asn Asp Leu Tyr Asn Ala Asp Tyr Phe Pro Ala Arg He Lys 
260 265 270 

Val Met Gin Ser Leu Val Tyr Ser Leu Cys Arg Ala Asn Tyr Thr Phe 
275 280 285 



Asn Leu Leu Ser His Val Phe Asn Val His Lys Gly He Lys Leu Gly 
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Asp Thr Asn Phe 
305 

Arg Asn Ser Giu 

Giy Gin Phe Val 
340 



295 

Ser Lys Ser Val 

3io 

Leu Gin Asd Thr 
325 

Met 



110 

300 

lie Ala His Ser 

315 

Tyr Pro Asp Thr 
330 



Lys Arg Asn Gly 
320 

Leu Asp Arg Cys 
335 



<210> 74 
<211> 1869 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 74 

atgcagtaca 

ggggcgtttc 

acacgtctta 

cgcctacatc 

tggaatattt 

gcttggattc 

gaaatcatcg 

accaatattt 

atggtggaaa 

gcattgggac 

aatggatggg 

actgccatgt 

atacaaattc 

tgtccggaga 

tacgcatttt 

cgacgtagac 

ttagacaatt 

tgcatggtga 

tggcggcatc 

aaagccgtca 

actgaatcct 

gaacacgcaa 

tacggtacaa 

gcattggaaa 

gatttggtga 

ccaagttctc 

gtagcacaaa 

gaatgccctt 

atgcaactga 

caaaatcata 

tgccttcaca 

ctaagatga 



ttgtagcgtc 
cggaagaaga 
tgttcatgca 
tgataactga 
caaactgcga 
caaattcaca 
gcaatgacat 
ttgatttgtg 
atctttccga 
gtggatttaa 
caagcaaatg 
cagatcaaga 
catgtgctta 
ctccacttgc 
ttgacaagat 
gatcttttaa 
tccgaattat 
cgcagttttc 
ctatatctac 
cagaattgaa 
ggatgctaga 
actgcaaata 
taatcgatca 
tgacataccc 
tagaacattt 
acgtacctac 
atgtaaatta 
tttacgatca 
aaatgatgaa 
atgcttcaaa 
cgttaaaaaa 



atattttaca 
ttacattcgc 
gcattttaaa 
tgaaaatcac 
gtggtttttt 
ttattcaaaa 
tggaaagatt 
gaaacaattt 
ttggtatctt 
cactggaata 
gagagtagtg 
catatttaat 
caattatcaa 
gctacatttc 
tcgaaaagcg 
aggaaataat 
cccaaatgca 
caaagatcga 
agcagtatac 
tcgcacagat 
crctttatat 
tattttaatg 
aactggaaac 
gcagctgagg 
attgaacaaa 
aaatatttca 
tgaaaagaac 
gcggttcgga 
ttataagttc 
atcattgaaa 
taaatttatg 



atatggaact 
cttgcgtata 
agtgtattga 
cgatccgata 
cacaatctca 
tattatggtt 
atgttcatgg 
agaaacttta 
aacaaggatg 
atcatgtttg 
gctaacaaat 
gcatacattc 
cttggagcac 
aactcacaaa 
tttgatgaaa 
caaaaagata 
attggaagaa 
cttaatcatt 
gggaaggata 
atcacaatcc 
cccataaatt 
actgatgtag 
ctgaaacaaa 
ttaaatttat 
accattcaaa 
aaatggataa 
tatgaacctt 
ggatttggct 
ttggtttctc 
cgctggagac 
aaaaaaactg 



ttgtggacca 
ttatcggtgg 
agtactctga 
ttcatgagct 
cagagttcga 
tgagcaaact 
atgttgatat 
acaattccca 
gtaaaaagtc 
atttggataa 
atttacgaat 
atgactatcc 
tcacaaaatc 
acaagactgt 
tggatggttc 
tctgccatga 
tgacaaaacc 
tt ctggaaag 
aagatt tgct 
atctggtttt 
ttttgagaaa 
attttgtggt 
aagaagttct 
caaattttct 
ct tttcggga 
agtcaaatcg 
attttgtaat 
ggaataaagt 
caactagttt 
gtgacccaca 
ctagtcgact 



cacacgtgtt 
aaattttatg 
ccatttcttt 
catgacgtca 
aaaacgagtt 
tttaattcct 
catttttcaa 
ggttttcgga 
ggtttggcca 
gctccggaaa 
tcacggaaaa 
gactgaaatt 
caaagaacta 
tggaaaaaat 
agatcttaaa 
atatttacca 
tgcagaactc 
tgcaaatgcg 
agacattgca 
tgaggaacca 
tgtagcaatt 
acttggagat 
ggtgatccct 
ctcgagaaaa 
aacaatttgg 
cacttatatg 
caaaaaagaa 
aactcatgta 
tatgattcat 
ttatcagaaa 
gggaatcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1869 



<210> 75 
<211> 622 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 75 

Met Gin Tyr He Val Ala Ser Tyr Phe Thr He Trp Asn Phe Val Asp 
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15 10 15 

His Thr Arg Val Gly Ala Phe Pro Glu Glu Asp Tyr lie Arg Leu Ala 

20 25 30 

Tyr lie lie Gly Gly Asn Phe Met Thr Arg Leu Met Phe Met Gin His 
35 40 45 

Phe Lys Ser Val Leu Lys Tyr Ser Asp His Phe Phe Arg Leu His Leu 
50 55 60 

lie Thr Asp Glu Asn His Arg Ser Asp lie His Glu Leu Met Thr Ser 
65 70 75 80 

Trp Asn lie Ser Asn Cys Glu Trp Phe Phe His Asn Leu Thr Glu Phe 

85 90 95 

Glu Lys Arg Val Ala Trp lie Pro Asn Ser His Tyr Ser Lys Tyr Tyr 
100 105 110 

Gly Leu Ser Lys Leu Leu lie Pro Glu lie lie Gly Asn Asp lie Gly 
115 120 125 

Lys lie Met Phe Met Asp Val Asp lie lie Phe Gin Thr Asn lie Phe 
130 135 140 

Asp Leu Trp Lys Gin Phe Arg Asn Phe Asn Asn Ser Gin Val Phe Gly 
145 150 155 160 

Met Val Glu Asn Leu Ser Asp Trp Tyr Leu Asn Lys Asp Gly Lys Lys 

165 170 175 

Ser Val Trp Pro Ala Leu Gly Arg Gly Phe Asn Thr Gly lie lie Met 
180 185 190 

Phe Asp Leu Asp Lys Leu Arg Lys Asn Gly Trp Ala Ser Lys Trp Arg 
195 200 205 

Val Val Ala Asn Lys Tyr Leu Arg lie His Gly Lys Thr Ala Met Ser 
210 215 220 

Asp Gin Asp He Phe Asn Ala Tyr He His Asp Tyr Pro Thr Glu He 
225 230 235 240 

lie Gin He Pro Cys Ala Tyr Asn Tyr Gin Leu Gly Ala Leu Thr Lys 
245 250 255 

Ser Lys Glu Leu Cys Pro Glu Thr Pro Leu Ala Leu His Phe Asn Ser 
260 265 270 

Gin Asn Lys Thr Val Gly Lys Asn Tyr Ala Phe Phe Asp Lys He Arg 
275 280 285 

Lys Ala Phe Asp Glu Met Asp Gly Ser Asp Leu Lys Arg Arg Arg Arg 
290 295 300 

Ser Phe Lys Gly Asn Asn Gin Lys Asp He Cys His Glu Tyr Leu Pro 
305 310 315 320 
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Leu Asp Asn Phe Arg lie lie Pro Asn Ala lie Gly Arg Met Thr Lys 

325 330 335 

Pro Ala Glu Leu Cys Met Val Thr Gin Phe Ser Lys Asp Arg Leu Asn 
340 345 350 

His Phe Leu Giu Ser Ala Asn Ala Tro Arg His Pro lie Ser Thr Ala 
355 360 " 365 

Val Tyr Gly Lys Asp Lys Asp Leu Leu Asp lie Ala Lys Ala Val Thr 
370 375 380 

Glu Leu Asn Arg Thr Asp lie Thr lie His Leu Val Phe Glu Glu Pro 
385 390 395 400 

Thr Giu Ser Trp Met Leu Asp Ser Leu Tyr Pro lie Asn Phe Leu Arg 

405 410 415 

Asn Val Ala lie Giu His Ala Asn Cys Lys Tyr lie Leu Met Thr Asp 
420 425 430 

Val Asp Phe Val Val Lcl: Gly Asp Tyr Gly Thr lie lie Asp Gin Thr 
435 440 445 

Gly Asn Leu Lys Gin Lys Giu Val Leu Vai lie Pro Aia Leu Glu Met 
450 ' 455 460 

Thr Tyr Pro Gin Leu Arg Leu Asn Leu Ser Asn Phe Leu Ser Arg Lys 
465 470 475 480 

Asp Leu Val lie Giu His Leu Leu Asn Lys Thr lie Gin Thr Phe Arg 

485 490 495 

Giu Thr lie Trp 'pro Ser Ser His Vai Pro Thr Asn lie Ser Lys Trp 
500 505 510 

lie Lys Ser Asn Arq Thr Tyr Met Val Ala Gin Asn Vai Asn Tyr Giu 
515 ' 520 525 

Lys Asn Tyr Glu Pro Tyr Phe Val lie Lys Lys Glu Glu Cys Pro Phe 
530 535 540 

Tyr Asp Gin Arg Phe Giy Gly Phe Gly Trp Asn Lys Vai Thr His Val 
545 550 555 560 

Met Gin Leu Lys Met Mez Asn Tyr Lys Phe Leu Val Ser Pro Thr Ser 

565 570 575 

Phe Met lie His Gin Asn His Asn Aia Ser Lys Ser Leu Lys Arg Trp 
580 585 590 

Arg Arg Asp Pro His Tvr Gin Lys Cys Leu His Thr Leu Lys Asn Lys 
595 ' 600 605 

Phe Met Lys Lys Thr Aia Ser Arg Leu Giy lie Lys Leu Arg 
610 615 620 
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<2 1 0 > 7 6 
<211> 417 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 76 

Met Val Ser Leu Gin Lys Ser lie Gly Leu Leu Leu Leu Ser Ala lie 
15 10 15 

lie Gly Leu Val Phe Leu lie Gin His Arg Lys Ser Tyr Thr Ser Ser 
20 25 30 

Asp Ala Leu Leu Glu Asn Gly Tyr Pro Asn Lys Tyr Tyr Thr lie Glu 
35 40 45 

Asn Pro Ala Glu Glu Gly Glu Arg Arg Ser Tyr Ser lie Gin Thr Glu 
50 55 60 

Met His Ala Asp Gin Tyr Cys lie Ala Tyr Lys Phe Leu Glu Ala Thr 
65 70 75 80 

Glu Ser Phe Arg Glu Ala Asp Gly Leu Glu Pro Val Thr Leu Ala Thr 

85 90 95 

His Ala Thr Ala Asp Met lie Glu Thr Val Glu Asn Met Thr Phe Leu 
100 105 110 

Trp Asp Gly Pro lie Ser lie Gly lie Phe Val Asp Tyr His Ser Tyr 
115 120 125 

Asn Val Leu Glu Tyr Leu Ala Glu Val His Arg Cys Asp Val Ser Phe 
130 135 140 

Arg Arg Lys Met Asn Val His Phe Ala Phe Arg Arg Ser Pro Phe Gin 
145 150 155 160 

Thr Glu Cys Pro Leu lie Glu lie Pro Gin Ser Asn Arg Ser Cys Gin 
165 170 175 

Glu Phe Phe Ala Thr His Thr Glu Leu Arg Asn Ala lie Val Gly Pro 
180 185 190 

Phe Gin Leu Tyr Pro Ser Asn Leu Met Arg Asn lie Ala Arg Lys Gly 
195 200 205 

Ala Gin Thr Asp Leu Gin Phe lie Met Asp Gly Asp Met Val Pro Ser 
210 215 220 

Glu Gly Phe Ala Thr Lys lie Lys Arg lie Ala Asn Glu Val lie Asp 
225 230 235 240 

Gly Lys Asn Lys Arg Val Leu Ala lie Arg Arg Phe Glu Thr Ser Asp 
245 250 255 

Thr Ala Glu lie Pro Arg Asp His Leu Lys Leu Leu Lys Ser Lys Lys 
260 265 270 
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Leu His Lys Thr Phe 
275 

His lie Asp Gly Leu 
290 

Val Val Thr Thr Lys 
305 

Gin Thr lie Leu His 

325 

Ser Arg lie Lys Val 
340 

Gly Tyr Thr Phe His 
355 

lie Lys His Thr Asn 
370 

Glu Ala Tyr Ala Het 
385 

Asp Asp Leu Tyr Pro 

405 

lie 



<210> 77 
<211> 1050 
<212> DNA 
<213> Caenorhabdi tis elegans 

<400> 77 

atgcacgacg agcagttttg tgtrggttat aacttcctgg aggccgaaga tacattccga 60 

gaagacggct tggaaccagt aacrcttgct attcatggca ctccagaagt tttgcaacta 120 

ttggggaata agcctttgaa ctgggatggg cctatttcat ttggtctatt cgtagatttt 180 

cattcccaaa aggccctgaa ttatatttcc atgctacata aatgtgatgc agcttttaaa 240 

agacagatga ctgtccactt tgcattccga atctcaccat ctcaatccga atgcccaatg 3O0 

attcaagttc ttgggtatca ggattgtgcc acatttttac agaaaagcaa gcagctcctt 360 

gaggaaattg aggactcttt tcaaatctac ccgataaacc taatgagaaa tattgctcgg 420 

cgcggagcaa agtcggattt acacttgata atcgatacag atatgatgat gagcaccaac 480 

tttgcaaaga tggtaaaacc aatcgcaaat cggatgattg atgggaagaa taagcaagtg 540 

ttggttgtta gacgtttcga gaccaacgaa aatgagctac caatgagctt tggggatctt 600 

gaggagggaa ttgaaaatca taaaacattc cagttccatc acaaattctt tttcgttggt 660 

catcaaattc ccaacttgat ggaacggttc gaaagatctc acgcctctaa tgatgtggtg 720 

gcatgggaga ttccatacac tggaaatgat tgggaagttc aaatcattct tcaccgcaac 780 

gatccauata atgtagagta ttttcctrcg cgagtcaagg atatgcagtc tttgatctac 840 

aagttatgcc gtgcaaacta caccttcaat ttgctctctc atgtgttcaa tgttcataaa 9O0 

ggaatcaaag aagatgatac catgtactcg aaggttgtga ctgctcacac aaaacggcaa 960 

ggaagattga ggacgctttc tcgatatgtc actgaaattg acaggaaata cccggatacc 1020 

atgaaacgat gtgggcagtt tttgttataa 1050 

<210> 73 
<211> 349 



114 

Glu Phe His His Arg Tyr Phe Pro Glu Gly His 
280 285 

Asp Asp Trp Phe Arg Thr Ser lie His Ser Gly 
295 300 

Glu Val Ala Tyr Pro Giv Tyr Leu Trp Glu Val 
310 315 320 

Arg Asn Asp Pro Tyr Asn Ala Asp Tyr Phe Pro 

330 " 335 

Met His Ser Leu Val Tyr Ala Leu Cys Arg Ala 
345 350 

Val Pro Thr His Val Phe Asp Ser His Arg Gly 
360 365 

Thr lie Tyr Ser Lys Ala Thr lie Ala His Gin 
375 380 

Lys Glu Ala Gly Asp Arg Tyr lie Lys Glu Met 
390 395 400 

His Thr Leu Ser Gin Cys Gly Glu Phe Ser Met 

410 415 



wo 00/73328 



PCT/EPOO/05108 



115 

<212> PRT 

<213> Caenorhabdi t is elegans 
<400> 78 

Met His Asp Glu Gin Phe Cys Val Gly Tyr Asn Phe Leu Glu Ala Glu 
15 10 15 

Asp Thr Phe Arg Glu Asp Gly Leu Glu Pro Val Thr Leu Ala lie His 

20 25 30 

Gly Thr Pro Glu Val Leu Gin Leu Leu Gly Asn Lys Pro Leu Asn Trp 
35 40 45 

Asp Gly Pro lie Ser Phe Gly Leu Phe Val Asp Phe His Ser Gin Lys 
50 55 60 

Ala Leu Asn Tyr lie Ser Met Leu His Lys Cys Asp Ala Ala Phe Lys 
65 70 75 80 

Arg Gin Met Thr Val His Phe Ala Phe Arg lie Ser Pro Ser Gin Ser 

S5 90 95 

Glu Cys Pro Met Tie Gin Val Leu Gly Tyr Gin Asp Cys Ala Thr Phe 
100 105 110 

Leu Gin Lys Ser Lys Gin Leu Leu Glu Glu lie Glu Asp Ser Phe Gin 
115 120 125 

lie Tyr Pro lie Asn Leu Met Arg Asn lie Ala Arg Arg Gly Ala Lys 
130 135 140 

Ser Asp Leu His Leu lie lie Asp Thr Asp Met Met Met Ser Thr Asn 
145 150 155 160 

Phe Ala Lys Met Val Lys Pro lie Ala Asn Arg Met lie Asp Gly Lys 
165 170 175 

Asn Lys Gin Val Leu Val Val Arg Arg Phe Glu Thr Asn Glu Asn Glu 
180 185 190 

Leu Pro Met Ser Phe Gly Asp Leu Glu Glu Gly lie Glu Asn His Lys 
195 200 205 

Thr Phe Gin Phe His His Lys Phe Phe Phe Val Gly His Gin lie Pro 
210 215 220 

Asn Leu Met Glu Trp Phe Glu Arg Ser His Ala Ser Asn Asp Val Val 
225 230 235 240 

Ala Trp Glu lie Pro Tyr Thr Gly Asn Asp Trp Glu Val Gin He He 

245 250 255 

Leu His Arg Asn Asp Pro Tyr Asn Val Glu Tyr Phe Pro Ser Arg Val 
260 265 270 

Lys Asp Met Gin Ser Leu He Tyr Lys Leu Cys Arg Ala Asn Tyr Thr 
275 280 285 
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Phe Asn Leu Leu Ser His Val Phe 
290 295 

ksp Asp Thr Met Tyr Ser Lys Val 
305 310 

Gly Arg Leu Arg Thr Leu Ser Arg 

325 

Tyr .^ro Asp Thr Met Lys Arg Cys 
340 



116 

Asn Val His Lys Gly lie Lys Glu 
300 

Val Thr Ala His Thr Lys Arg Gin 
315 320 

Tyr Val Thr Glu lie Asp Arg Lys 
330 335 

Gly Gin Phe Leu Leu 
345 



' 1 167 

: 1 j - oriA 

•11^' Caenorhabditis elegans 



i:t::r- -acaga tttcctcaag atttactcca tttgctttgt ttctcctatt ttcaattcta 60 

rrtt.TCttgc ggtrtttgaa gaaatattct caagatcttt ctaggatctc tatagaactt 120 

laLgaaaarg agttttgcat tggctacaat ttcctggagg ctacagaaaa attccgagaa 180 

gacggrtrgg agcctgtgac acttgccatt catgggacat ccgatgtcct tgaagtagtg 240 

gagaagaagc catcaaactg ggatgggccr atatcattcg ggatgtttgt tgactatcac 300 

zzcca^aagq ctctggaata tgtggcaatg cttcatcagt gtgataagga gttcggggag 360 

aaagtcaccg ttcactatgt gttccgaact tctccttccc agatggattg tccagtgata 420 

actcctcatg tgtcggtgaa ttgtgatgaa tttcgtcgga atcgaaagca gctcctcaaa 480 

caaataacct ccccgtttca aatctaccca ataaacttga tgagaaatgt tgcccgccgt 540 

cgagcaacrt ctgatctaca cttgatagtc gacgctgata tgacaatgag ctctgatttt 600 

Gcgagaaaag ngaagccaat cgcaaatcgc ataattgatg ggaaacagag acaagttttg 660 

Gtagrtcgac gttrtgagac aaacgaagat gagattccac tggaagttga gcagctgaag 720 

atgggatttg agaatcaaaa agtattcgag ttccatcaca attttttctt tattgggcat 780 

aaaatrccag acgtggaaaa atggtttcac gcctcgaaga cagaaaatga agtgactgcc 840 

tgggaaatrc catactcagg aaatgcatgg gaagtgcaag tgattcttca ccggaacgac 900 

argtacaacg ccgagractt tccgtctaga atccgagaca tgcagtcttt gatctacggt 960 

ctctgccgag ccaactacac cLtcaacttg ctctctcacg tattcaatgt tcaccaaggc 1020 

atcaaagagg atgacacaat gtactcgaaa gttgtcacag ctcactcgaa gcgatatgga 1080 

aggaatagag cattctcccg cracgtccat gagatgaara ctgcgtatcc gggaactatt 1140 
cagcggtgcg ggaagtttga gatgtga 1167 

<210> 80 
<211> 388 
<212> PRT 

<213> Caenorhabdi tis elegans 
<400> 80 

Met Leu Lys lie Ser Ser Arg Phe Thr Pro Phe Ala Leu Phe Leu Leu 
15 10 15 

Phe Ser lie Leu Leu Cys Leu Trp Phe Leu Lys Lys Tyr Ser Gin Asp 
20 25 30 

Leu Ser Arg lie Ser He Glu Leu Tyr Glu Asn Glu Phe Cys He Gly 
35 40 45 



Tyr Asn Phe Leu Glu Ala Thr Glu Lys Phe Arg Glu Asp Gly Leu Glu 
50 55 60 
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Pro Val Thr Leu Ala lie His Gly Thr Ser Asp Val Leu Glu Val Val 
65 70 75 80 

Glu Lvs Lys Pro Ser Asn Trp Asp Gly Pro lie Ser Phe Gly Met. Phe 

85 90 95 

Val Asp Tyr His Ser Gin Lvs Ala Leu Glu Tyr Val Ala Met Leu His 
100 105 110 

Gin Cys Asp Lys Glu Phe Gly Glu Lys Val Thr Val His Tyr Val Phe 
115 120 125 

Arg Thr Ser Pro Ser Gin Met Asp Cys Pro Val lie Thr Pro Asp Val 
130 135 140 

Ser Val Asn Cys Asp Glu Phe Arg Arg Asn Arg Lys Gin Leu Leu Lys 
145 150 155 160 

Glu lie Thr Ser Pro Phe Gin lie Tyr Pro lie Asn Leu Met Arg Asn 

165 170 175 

Val Ala Arg Arg Gly Ala Thr Ser Asp Leu His Leu lie Val Asp Ala 
180 185 190 

Asp Met Thr Met Ser Ser Asp Phe Ala Arg Lys Val Lys Pro lie Ala 
195 200 205 

Asn Arg lie lie Asp Gly Lys Gin Arg Gin Val Leu Val Val Arg Arg 
210 215 220 

Phe Glu Thr Asn Glu Asp Glu lie Pro Leu Glu Val Glu Gin Leu Lys 
225 230 235 240 

Met Gly Phe Glu Asn Gin Lys Val Phe Glu Phe His His Asn Phe Phe 

245 250 255 

Phe lie Gly His Lvs lie Pro Asp Val Glu Lys Trp Phe His Ala Ser 
260 ' 265 270 

Lys Thr Glu Asn Glu Val Thr Ala Trp Glu lie Pro Tyr Ser Gly Asn 
275 280 285 

Ala Trp Glu Val Gin Val lie Leu His Arg Asn Asp Met Tyr Asn Ala 
290 295 300 

Glu Tyr Phe Pro Ser Arg He Arg Asp Met Gin Ser Leu He Tyr Gly 
305 310 315 320 

Leu Cys Arg Ala Asn Tyr Thr Phe Asn Leu Leu Ser His Val Phe Asn 

325 330 335 

Val His Gin Gly He Lys Glu Asp Asp Thr Met Tyr Ser Lys Val Val 
340 345 350 

Thr Ala His Ser Lys Arg Tyr Gly Arg Asn Arg Ala Phe Ser Arg Tyr 
355 360 365 

Val His Glu Met Asn Thr Ala Tyr Pro Gly Thr He Gin Arg Cys Gly 
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370 375 380 

Lys Phe Glu Met 
385 



<210> 81 
<211> 1275 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 81 

atgtgtactt ttaaaaaatt cgatggagaa actcgaaaaa ctcggatcca gattttgtat 60 

tttgctgcat cactcgttaa tctcgatctc aagcctgtga agctaaattc caatgcgaat 120 

atatgcgtca aaattgaaac ttctcacttt acctctggca cttattatat taacttggca 180 

tctgtacaat tcaaaggtaa tgctcctggt tctgatgctg aaggaaggtt tttcaagaaa 240 

ctacacggaa aaccagaaaa taattataat tccttacaga cgactgtaca ctttgcattc 300 

cgaatctcac catctcaaac cgaatgtcct gtgatctata cttccgggta taaggattgt 360 

gtcacgttrt tccaaaagaa cacagagctc cttgaggaaa tggaggaccc ttttcagatc 420 

tacccgataa atctaatgag aaatattgct cgacgcggag caaagtcgga tttacacttg 480 

atagtcgana cagatatggt aatgagtact aactttgcaa agatggtaaa accagttgcg 540 

aaccgganga ttgatgggat gaataaacaa gtcttggttg ttcgacgctt cgagaccaac 600 

gaaaccgaac ttccactgaa cttggacgaa cttgagcaag ggcttctgaa tgagaacaca 660 

tutgaattcc atcactcgtt cttttttgtt ggccatcaaa tacccaactt gtctgagtgg 720 

tttgaaaatt cttacgcatc agaagaaacc actgcatggg agattccata cacaggaagt 780 

gattgggaag ttcaaataat tcttcaccgc aacgacccat ataacattga gtacttccca 840 

tcgcgagtca gggatatgca gtctttgatt tataaactct gccgtgcgaa ctacacattc 900 

aatttgctct ctcacgtatt caatgttcac aaggggatca aagaagarga tacaatgtac 960 

tcgaaagncg tcactgctca cacaaagcaa ttttggaaaa tgaggtattt atttttttgt 1020 

tgtagagaat tcccaagata tgcttgtgaa tttacagaac gctttcccgt tacactgccg 1080 

aaatcgacaa gcagtaccca gacactacaa caagataatt tgccagatgt ttccttattt 1140 

ttttcaggag tattcagaat gttcacgcaa ttctcgaaat tttcagagca tttgaacatt 1200 

tttaaagccg gaaaagcgta ctgttttgtr gtttctgtca cttttctggt gtctttaaaa 1260 

tatggagaaa aataa 1275 

<210> 82 
<211> 424 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 82 

Met Cys Thr Phe Lys Lys Phe Asp Gly Glu Thr Arg Lys Thr Arg lie 
15 10 15 

Gin lie Leu Tyr Phe Ala Aia Ser Leu Val Asn Leu Asp Leu Lys Pro 
20 25 30 

Val Lys Leu Asn Ser Asn Ala Asn He Cys Val Lys He Glu Thr Ser 
35 40 45 

His Phe Thr Ser Gly Thr Tvr Tyr He Asn Leu Aia Ser Val Gin Phe 
50 55 60 

Lys Gly Asn Ala Pro Gly Ser Asp Ala Glu Gly Arg Phe Phe Lys Lys 
65 70 75 80 



Leu His Gly Lys Pro Glu Asn Asn Tyr Asn Ser Leu Gin Thr Thr Val 

85 90 95 
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'x b Filti Alci Put; Ary 1 It; 3t;r Fro 3€rr Gin Tnr Gj-U Cys Pro Val I ic 
100 105 110 

^vr Thr Ser Gly Tyr Lys Asp Cys Val Thr Phe Phe Gin Lys Asn Thr 
115 120 125 

Leu Leu Glu Glu Met Glu Asp Pro Phe Gin lie Tyr Pro lie Asn 
130 135 140 

„eu Met Arg Asn lie Ala Arg Arg Gly Ala Lys Ser Asp Leu His Leu 
1^5 150 155 160 

\'al Asp Thr Asp Met Val Met Ser Thr Asn Phe Ala Lys Met Val 
165 170 175 

:.\-s Fro Val Ala Asn Arg Met lie Asp Gly Met Asn Lys Gin Val Leu 
180 185 190 

.'dl V^I Arg Arg Phe Glu Thr Asn Glu Thr Glu Leu Pro Leu Asn Leu 
195 200 205 

Aso Glu Leu Glu Gin Gly Leu Leu Asn Glu A.sn Thr Phe Glu Phe His 
21C 215 220 

riis Ser Phe Phe Phe Val Gly His Gin lie Pro Asn Leu Ser Glu Trp 
225 230 235 240 

Phe Glu Asn Ser Tyr Ala Ser Glu Glu Thr Thr Ala Trp Glu lie Pro 
245 250 255 

Tvr Thr Gly Ser Asp Trp Glu Val Gin lie lie Leu His Arg Asn Asp 
260 ' 265 270 

Pro Tyr Asn lie Glu Tyr Phe Pro Ser Arg Val Arg Asp Met Gin Ser 
275 280 285 

Leu lie Tyr Lys Leu Cys Arg Ala Asn Tyr Thr Phe Asn Leu Leu Ser 
290 ^ 295 300 

His Val Phe Asn Val His Lys Gly lie Lys Glu Asp Asp Thr Met Tyr 
305 310 315 320 

Ser Lvs Val Val Thr Ala His Thr Lys Gin Phe Trp Lys Met Arg Tyr 

325 330 335 

Leu Phe Phe Cys Cys Arg Glu Phe Pro Arg Tyr Ala Cys Glu Phe Thr 
340 345 350 

Glu Arg Phe Pro Val Thr Leu Pro Lys Ser Thr Ser Ser Thr Gin Thr 
355 360 365 

Leu Gin Gin Asp Asn Leu Pro Asp Val Ser Leu Phe Phe Ser Gly Val 
370 375 380 

Phe Arg Met Phe Thr Gin Phe Ser Lys Phe Ser Glu His Leu Asn lie 
385 390 395 400 



wo 00/73328 PCT/EPOO/051 08 

120 

Phe Lys Ala Gly Lys Ala Tyr Cys Phe Val Val Ser Val Thr Phe Leu 
405 410 415 

Val Ser Leu Lys Tyr Gly Glu Lys 
420 

<210> 83 
<211> 370 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 83 

Met Glu Asp Asp Thr Pro Asp Val Ser Ser Asp Ser Asn Gly Asp Ala 
15 10 15 

Ala Tyr Ser Asp Tyr Phe Leu Asp Tyr Lys Ser He Met Asp Glu He 
20 25 30 

Thr He Thr Thr Gin Pro Lys Ser Gly Tyr Val He Arg Asn Lys Pro 
35 40 45 

Leu Arg Leu Gin Cys Arg Ala Asn His Ala Thr Lys He Arg Tyr Lys 
50 55 60 

Cys Ser Ser Lys Trp He Asp Asp Ser Arg He Glu Lys Leu He Gly 
^5 70 75 80 

Thr Asp Ser Thr Ser Gly Val Gly Tyr He Asp Ala Ser Val Asp He 

85 90 95 

Ser Arg He Asp Val Asp Thr Ser Gly His Val Asp Ala Phe Gin Cys 
100 105 110 

Gin Cys Tyr Ala Ser Gly Asp Asp Asp Gin Asp Val Val Ala Ser Asp 
115 120 125 

Val Ala Thr Val His Leu Ala Tyr Met Arg Lys His Phe Leu Lys Ser 
130 135 140 

Pro Val Ala Gin Arg Val Gin Glu Gly Thr Thr Leu Gin Leu Pro Cys 
145 150 155 160 

Gin Ala Pro Glu Ser Asp Pro Lys Ala Glu Leu Thr Trp Tyr Lys Asp 
165 170 175 

Gly Val Val Val Gin Pro Asp Ala Asn Val He Arg Ala Ser Asp Gly 
180 185 190 

Ser Leu He Met Ser Ala Ala Arg Leu Ser Asp Ser Gly Asn Tyr Thr 
195 200 205 

Cys Glu Ala Thr Asn Val Ala Asn Ser Arg Lys Thr Asp Pro Val Glu 
210 215 220 

Val Gin He Tyr Val Asp Gly Gly Trp Ser Glu Trp Ser Pro Trp He 
225 230 235 240 
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Gly Thr Cys His Val Asp Cys Pro Leu Leu Arg Gin His Ala His Arg 
245 250 255 

lie Arg Asp Pro His Asp Val Leu Pro His Gin Arq Arg Thr Arg Thr 
260 265 ' 270 

Cys Asa Asn Pro Ala Pro Leu Asn Asp Gly Glu Tyr Cys Lys Gly Glu 
275 280 285 

Glu Glu Met Thr Arg Ser Cys Lys Val Pro Cys Lys Leu Asp Gly Gly 
290 295 300 

Trp Ser Ser Trp Ser Asp Tro Ser Ala Cys Ser Ser Ser Cys His Arg 
305 310 ' 315 320 

Tyr Arg Thr Arg Ala Cys Thr Val Pro Pro Pro Met Asn Gly Gly Gin 
325 330 335 

Pro Cys Phe Gly Asp Asp Leu Met Thr Gin Glu Cys Pro Ala Gin Leu 
340 ^ 345 350 

Cys Thr Ala Asp Ser Ser Arg lie Val lie Ser Asp Thr Ala Val Tyr 
355 360 365 

Gly Ser 
370 



<210> 84 
<211> 20 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 84 

Val Ala Ser lie Phe lie Val Ala Ser Phe lie Leu Ala lie Leu Ala 
15 10 15 

Met Phe Cys Cys 
20 



<210> 85 
<211> 122 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 85 

Lys Arg Gly Asn Ser Lys Lys Ser Lys Pro Leu Lys Pro Gin Lys Met 
15 10 15 

Asn Ser Glu Lys Ala Gly Gly He Tyr Tyr Ser Glu Pro Pro Gly Val 
20 25 30 

Arg Arg Leu Leu Leu Glu His Gin His Gly Thr Leu Leu Gly Glu Lys 
35 40 45 

He Ser Ser Cys Ser Gin Tyr Phe Glu Pro Pro Pro Leu Pro His Ser 
50 55 60 
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Thr Thr Leu Arg Ser Gly Lys Ser 
65 70 

Arg Asn Ala Gly Ser Arg Ala Ala 

85 

Ser Ser Gly Ser Gly Glv Lys Arg 
ICO 

Asn Cys Ser Asp Asp Asp Asn Tyr 
115 120 



122 

Ala Phe Ser Gly Tyr Ser Ser Thr 
75 80 

Leu lie Gin Glu Cys Ser Ser Ser 
90 95 

Thr Met Leu Arg Thr Ser Ser Ser 
105 110 

Ala Thr 



<210> 86 
<211> 165 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 86 

Leu Tyr Asp Tyr Met Glu Asp Lys Ser Val Leu Gly Leu Asp Thr Ser 
15 10 15 

Gin Asn lie Val Ala Ala Gin lie Asp Ser Asn Gly Ala Arg Leu Ser 
20 25 30 

Leu Ser Lys Ser Gly Ala Arg Leu lie Val Pro Glu Leu Ala Val Glu 
35 40 45 

Gly Glu Lys Met Leu Tyr Leu Ala Val Ser Asp Thr Leu Thr Asp Gin 
50 55 60 

Pro His Leu Lys Pro lie Glu Ser Ala Leu Ser Pro Val lie Val lie 
65 70 75 80 

Gly Gin Cys Asp Val Ser Met Ser Ala His Asp Asn lie Leu Arg Arg 

85 90 95 

Pro Val Val Val Ser Phe Arg His Cys Ala Ser Thr Phe Pro Arg Asp 
100 105 110 

Asn Trp Gin Phe Thr Leu Tyr Ala Asp Glu Gly Ser Gly Trp Gin Lys 
115 120 125 

Ala Val Thr He Gly Glu Glu Asn Leu Asn Thr Asn Met Phe Val Gin 
130 135 140 

Phe Glu Gin Pro Gly Lys Lys Asn Asp Gly Phe Gly Trp Cys His Val 
145 150 155 160 

Met Thr Tyr Ser Leu 
165 



<210> 87 
<211> 157 
<212> PRT 

<213> Caenorhabditis elegans 
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Ala Arg Leu Met Leu Ala Gly His Pro Arg Arg Asn Ser Leu Ser Ala 
15 10 15 

Ala Lys Arg Val His Leu Ala Val Phe Gly Pro Thr Glu Met Ser Ala 
20 25 30 

Tyr Arg Arg Pro Phe Glu Leu Arg Val Tyr Cys Val Pro Glu Thr Gly 
35 40 45 

Ala Ala Met Glu Ser Val Trp Lys Gin Glu Asp Gly Ser Arg Leu Leu 
50 55 60 

Cys Glu Ser Asn Asp Phe lie Leu Asn Glu Lys Gly Asn Leu Cys lie 
65 70 75 80 

Cys lie Glu Asp Val lie Pro Gly Phe Ser Cys Asp Gly Pro Glu Val 

85 90 95 

Val Glu lie Ser Glu Thr Gin His Arg Phe Val Ala Gin Asn Gly Leu 
100 105 110 

His Cys Ser Leu Lys Phe Arg Pro Lys Glu lie Asn Gly Ser Gin Phe 
115 120 125 

Ser Thr Arg Val lie Val Tyr Gin Lys Ala Ser Ser Thr Glu Pro Met 
130 135 140 

Val Met Glu Val Ser Asn Glu Pro Glu Leu Tyr Asp Ala 
145 150 155 



<210> 88 
<211> 113 
<212> PRT 

<21 3> Caenorhabditis elegans 
<400> 88 

Thr Ser Glu Glu Arg Glu Lys Gly Ser Val Cys Val Glu Phe Arg Leu 
15 10 15 

Pro Phe Gly Val Lys Asp Glu Leu Ala Arg Leu Leu Asp Met Pro Asn 
20 25 30 

Glu Ser His Ser Asp Trp Arg Gly Leu Ala Lys Lys Leu His Tyr Asp 
35 40 45 

Arg Tyr Leu Gin Phe Phe Ala Ser Phe Pro Asp Cys Ser Pro Thr Ser 
50 55 60 

Leu Leu Leu Asp Leu Trp Glu Ala Ser Ser Ser Gly Ser Ala Arg Ala 
65 70 75 80 

Val Pro Asp Leu Leu Gin Thr Leu Arg Val Met Gly Arg Pro Asp Ala 

85 90 95 

Val Met Val Leu Glu Arg Phe Leu Ser Ala Phe Pro Gin lie Val Ser 
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100 



105 



110 



Pro 



<210> 89 
<211> 437 
<212> PRT 

<213> Homo sapiens 
<400> 89 

His Met Ala Thr Leu His His Ser Ser Pro Thr Ser Glu Ala Glu Glu 
15 10 15 

Phe Val Ser Arg Leu Ser Thr Gln^Asn Tyr Phe Arg Ser Leu Pro Arg 
20 25 30 

Gly Thr Ser Asn Met Thr Tyr Gly Thr Phe Asn Phe Leu Gly Giy Arg 
35 40 45 

Leu Met lie Pro Asn Thr Gly lie Ser Leu Leu lie Pro Pro Asp Ala 
50 55 60 

lie Pro Arg Gly Lys lie Tyr Glu lie Tyr Leu Thr Leu His Lys Pro 
65 70 75 80 

Glu Asp Val Arg Leu Pro Leu Ala Gly Cys Gin Thr Leu Leu Ser Pro 

85 90 95 

lie Val Ser Cys Gly Pro Pro Gly Val Leu Leu Thr Arg Pro Val lie 
100 105 110 

Leu Ala Met Asp His Cys Gly Glu Pro Ser Pro Asp Ser Trp Ser Leu 
115 120 125 

Arg Leu Lys Lys Gin Ser Cys Glu Gly Ser Trp Glu Asp Val Leu His 
130 135 140 

Leu Gly Glu Glu Ala Pro Ser His Leu Tyr Tyr Cys Gin Leu Glu Ala 
145 150 155 160 

Ser Ala Cys Tyr Val Phe Thr Glu Gin Leu Gly Arg Phe Ala Leu Val 
165 170 175 

Gly Glu Ala Leu Ser Val Ala Ala Ala Lys Arg Leu Lys Leu Leu Leu 
180 185 190 

Phe Ala Pro Val Ala Cys Thr Ser Leu Glu Tyr Asn He Arg Val Tyr 
195 200 205 

Cys Leu His Asp Thr His Asp Ala Leu Lys Glu Val Val Gin Leu Glu 
210 215 220 

Lys Gin Leu Gly Gly Gin Leu He Gin Glu Pro Arg Val Leu His Phe 
225 230 235 240 

Lys Asp Ser Tyr His Asn Leu Arg Leu Ser He His Asp Val Pro Ser 
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245 250 255 

Ser Leu Trp Lys Ser Lys Leu Leu Val Ser Tyr Gin Glu lie Pro Phe 

260 265 270 

Tyr His lie Trp Asn Gly Thr Gin Arg Tyr Leu His Cys Thr Phe Thr 
275 " 280 285 

Leu Glu Arg Val Ser Pro Ser Thr Ser Asp Leu Ala Cys Lys Leu Trp 
290 295 300 

Val Trp Gin Val Glu Gly Asp Gly Gin Ser Phe Ser lie Asn Phe Asn 
305 310 315 320 

lie Thr Lys Asp Thr Arg Phe Ala Glu Leu Leu Ala Leu Glu Ser Glu 

325 330 335 

Ala Gly Val Gin Ala Leu Val Gly Pro Ser Ala Phe Lys lie Pro Phe 
340 345 350 

Leu lie Arg Gin Lys lie lie Ser Ser Leu Asp Pro Pro Cys Arg Arg 

355 360 365 

Gly Ala Asp Trp Arg Thr Leu Ala Gin Lys Leu His Leu Asp Ser His 
370 375 380 

Leu Ser Phe Phe Ala Ser Lys Pro Ser Pro Thr Ala Met lie Leu Asn 
385 390 395 400 

Leu Trp Glu Ala Arg His Phe Pro Asn Gly Asn Leu Ser Gin Leu Ala 

405 410 415 

Ala Ala Val Ala Gly Leu Gly Gin Pro Asp Ala Gly Leu Phe Thr Val 
420 425 430 

Ser Glu Ala Glu Cys 
435 



<210> 90 
<211> 931 
<212> PRT 

<2 1 3> Homo sapiens 
<400> 90 

Met Arg Lys Gly Leu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
15 10 15 

Gly Tyr Leu Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
20 25 30 

Ala Ser Gly Thr Gly Ser Ala Ala Gin Aso Asp Asp Phe Phe His Glu 
35 40 45 

Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 55 60 

Leu lie Glu Pro Glu Glu Ala Tyr lie Val Lys Asn Lys Pro Val Asn 
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65 70 75 80 

Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin lie Tyr Phe Lys Cys Asn 

85 90 95 

Ser Glu Trp Val His Gin Lys Asp His lie Val Asp Glu Arg Val Asp 
100 105 110 

Glu Thr Ser Gly Leu lie Val Arg Glu Val Ser lie Glu lie Ser Arg 
115 120 125 

Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys 
130 135 140 

Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 
145 150 155 160 

Arg lie Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys 

165 170 175 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Giu 
180 185 190 

Gly lie Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp lie lie 
195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr lie Thr lie Asp His Asn Leu 
210 215 220 

lie lie Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 235 240 

Ala Lys Asn lie Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val lie 
245 250 255 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 265 270 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
275 280 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
290 295 300 

Gin Lys lie Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 310 315 320 

Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 
325 330 335 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 345 350 

Asp Gly Leu' Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 
355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly lie Val 
370 375 380 
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lie Ala val lie Vai Cys Leu Ala 
385 390 

Val Tyr Arg Lys Asn His Arg Asp 

405 

Ser Ala Leu Asn Gly Gly Phe Gin 
420 

Gin Asp Leu Leu Ala Val Pro Pro 

435 440 

Tyr Arg Gly Pro Val 
450 

Met Thr Asn Ser Pro lie Leu Asp 
465 470 

Vai Tyr Asn Thr Ser Gly Ala Val 

485 

Phe Thr Ser Lys Leu Ser 
500 

Glu Ala Leu Ser Leu Lys Asn Gin 
515 ^ 520 



127 

lie Ser Vai Vai Val Ala Leu Phe 
395 400 

Phe Glu Ser Asp lie lie Asp Ser 
410 415 

Pro Val Asn lie Lys Ala Ala Arg 
425 ' 430 

Asp Leu Thr Ser Ala Ala Ala Met 

445 

Val Ser Asp Lys lie Pro 
460 

Pro Leu Pro Asn Leu Lys lie Lys 

475 480 

Ser Pro Gin Asp Asp Leu Ser Glu 
490 495 

Gin Ser Leu Leu Glu Asn 
510 

Ser Leu Ala Arg Gin Thr Asp Pro 

525 



Tyr Ala Leu His Asp 
455 



Pro Gin Met Thr 
505 



Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu lie 

530 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu lie Pro Ala Gly Ala lie Pro 

545 ^ 550 555 560 

Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 

565 570 575 

Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 

580 ' 585 590 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 

595 600 605 

Met His His Cys Ala Aso Pro Asn Thr Glu Asp Trp Lys lie Leu Leu 

610 * 615 620 

Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val Gly 

625 630 635 640 

Glu Glu Asn Phe Thr Thr Pro Cys Tyr lie Lys Leu Asp Ala Glu Ala 

645 650 655 

Cys His lie Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly His 

660 665 670 

Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala lie Phe Gly 

675 680 685 
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Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser lie Arg Val Tyr Cys Leu 
690 695 700 

Asp Asp Thr Gin Asp Ala Leu Lys Glu lie Leu His Leu Glu Arg Gin 
■705 710 715 720 

rhr Giy Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys Giy 

725 730 735 

.^"or Tnr His Asn Leu Arg Leu Ser lie His Asp lie Ala His Ser Leu 
740 745 750 

1 :;- ^ys Ser Lys Leu Leu Ala Lys Tyr Gin Glu lie Pro Phe Tyr His 
755 760 765 

'."o: Tr:. Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu 

775 780 

y-.r:i Fr.*^ Sor Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val Arg 

790 795 800 

CI- Glu Gly Glu Gly Gin lie Phe Gin Leu Asn Cys Thr Val Ser 

805 810 815 

Glu G-u Pro Thr Gly lie Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr 
820 825 830 

He Tnr Thr Val Thr Gly Pro Ser Ala Phe Ser He Pro Leu Pro He 
635 840 845 

Arg Gin Lys Leu Cys Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly His 
c50 855 860 

Asp Trp Arg Mez Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu Asn 
865 870 875 880 

Tyr ?ne Ala Thr Lys Ser Ser Pro Thr Gly Val He Leu Asp Leu Trp 
885 890 895 

Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala Val 
900 905 910 

Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala Glu 
915 920 925 

Gly Gin Tyr 
930 



<210> 91 
<211> 9700 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
PGC1037 
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tatggccaca ctgcaccaca gctctcccac 
ctccacccag aactacttcc gctccctgcc 
cttcaactnc ctcgggggcc ggctgatgat 
cccagatgcc ataccccgag ggaagatcta 
agacgtgagg ttgcccctag ctggctgtca 
accccczgqc gtcctgctca cccggccagt 
cagcccrgac agctggagcc tgcgcctcaa 
tgtgctgcac ccgggcgagg aggcgccctc 
tgcctgcnac gtcttcaccg agcagctggg 
cgtggctgcc gccaagcgcc tcaagctgct 
cgagtacaac atccgggtct actgcctgca 
gcagctggag aagcagctgg ggggacagct 
ggacagtrac cacaacctgc gcctatccat 
taagctcctt gtcagctacc aggagatccc 
gtacttgcac tgcaccttca ccctggagcg 
caagctgtgg gtgtggcagg tggagggcga 
caccaaggac acaaggtttg ctgagctgct 
cctggtgggc cccagtgcct tcaagatccc 
cctggaccca ccctgtaggc ggggtgccga 
ggacagccat ctcagcttct ttgcctccaa 
gtgggaggcg cggcacttcc ccaacggcaa 
actgggccag ccagacgctg gcctcttcac 
gcagccaagc taattccggg cgaatttctt 
tataaaaaaa ataagtgtat acaaatttta 
cttgttcttg agtaactctt tcctgtaggt 
cgctcttatt gaccacacct ctaccggcat 
tttcctgrgt gaaattgtta tccgctcaca 
aagtgtaaag cctggggtgc ctaatgagtg 
ctgcccgctt tccagtcggg aaacctgtcg 
gcggggagag gcggtttgcg tattgggcgc 
cgctcggtcg ttcggctgcg gcgagcggta 
tccacagaat caggggataa cgcaggaaag 
aggaaccgta aaaaggccgc gttgctggcg 
catcacaaaa atcgacgctc aagtcagagg 
caggcgtttc cccctggaag ctccctcgtg 
ggatacctgt ccgcctttct cccttcggga 
aggtatctca gttcggtgta ggtcgttcgc 
gttcagcccg accgctgcgc cttatccggt 
cacgacttat cgccactggc agcagccact 
ggcggtgcra cagagttctt gaagtggtgg 
tttggtatct gcgctctgct gaagccagtt 
tccggcaaac aaaccaccgc tggtagcggt 
cgcagaaaaa aaggatctca agaagatcct 
tggaacgaaa actcacgtra agggattttg 
tagatccttt taaattaaaa atgaagtttt 
tggtctgaca gttaccaatg cttaatcagt 
cgttcatcca tagttgcctg actccccgtc 
ccatctggcc ccagtgctgc aatgataccg 
tcagcaataa accagccagc cggaagggcc 
qcczccB'ccc agtctattaa ttgttgccgg 
agtttgcgca acgttgttgc cattgctaca 
atggcttcat tcagctccgg ttcccaacga 
tgcaaaaaag cggttagctc cttcggtcct 
gtgttatcac tcatggttat ggcagcactg 
agatgctttt ctgtgactgg tgagtactca 
cgaccgagtt gctcttgccc ggcgtcaata 
ttaaaagrgc tcancattgg aaaacgttct 



129 

ctctgaggcc gaggagttcg tctcccgccL 60 
ccgaggcacc agcaacatga cctatgggac 120 
ccctaataca ggaatcagcc tcczcazccc 180 
tgagatctac ctcacgctgc acaagccgga 240 
gaccctgctg agtcccatcg ttagctgtgg 300 
catcctggct atggaccact gtggggagcc 360 
aaagcagtcg tgcgagggca gctgggagga 420 
ccacctctac tactgccagc tggaggccag 480 
ccgctttgcc ctggrgggag aggccctcag 540 
tctgtttgcg ccggtggcct gcacctccct 600 
tgacacccac gatgcactca aggaggtggt 660 
gatccaggag ccacgggtcc tgcacttcaa 720 
ccacgatgtg cccagctccc tgtggaagag 780 
cttttatcac atctggaatg gcacgcagcg 840 
tgtcagcccc agcactagtg acctggcctg 900 
cgggcagagc ttcagcatca acttcaacat 960 
ggctctggag agtgaagcgg gggtccaagc 1020 
cttcctcatt cggcagaaga taatttccag 1080 
ctggcggact ctggcccaga aactccacct 1140 
gcccagcccc acagccatga tcctcaacct 1200 
cctcagccag ctggctgcag cagtggctgg 1260 
agtgtcggag gctgagtgct gagtcgacct 1320 
atgatttatg atttttatta ttaaataagt 1380 
aagtgactct taggttttaa aacgaaaatt 1440 
caggttgctt tctcaggtat agcatgaggt 1500 
gcaagcttgg cgtaatcatg gtcatagctg 1560 
attccacaca acatacgagc cggaagcata 1620 
aggtaactca cattaattgc gttgcgctca 1680 
tgccagctgg attaatgaat cggccaacgc 1740 
tcttccgctt cctcgctcac tgactcgctg 1800 
tcagctcact caaaggcggt aatacggtta 1860 
aacatgtgag caaaaggcca gcaaaaggcc 1920 
tttttccata ggctccgccc ccctgacgag 1980 
tggcgaaacc cgacaggact ataaagatac 2040 
cgctctcctg ttccgaccct gccgcttacc 2100 
agcgtggcgc tttctcatag ctcacgctgt 2160 
tccaagctgg gctgtgtgca cgaacccccc 2220 
aactatcgtc ttgagtccaa cccggtaaga 2280 
ggtaacagga ttagcagagc gaggtatgta 2340 
cctaactacg gctacactag aaggacagta 2400 
accttcggaa aaagagttgg tagctcttga 2460 
ggtttttttg tttgcaagca gcagattacg 2520 
ttgatctttt ctacggggtc tgacgctcag 2580 
gtcargagat tatcaaaaag gatcttcacc 2640 
aaatcaatct aaagtatata tgagtaaact 2700 
gaggcaccta tctcagcgat ctgtctattt 2760 
gtgragataa ctacgatacg ggagggctta 2820 
cgagacccac gctcaccggc tccagattta 2880 
gagcgcagaa gtggtcctgc aactttatcc 2940 
gaagctagag taagtagttc gccagttaat 3000 
ggcatcgtgg tgtcacgctc gtcgtttggt 3060 
tcaaggcgag ttacatgatc ccccatgttg 3120 
ccgatcgttg tcagaagtaa gttggccgca 3180 
cataattctc ttactgtcat gccatccgta 3240 
accaagtcat tctgagaata gtgtatgcgg 3300 
cgggataata ccgcgccaca tagcagaact 3360 
tcggggcgaa aactctcaag gatcttaccg 3420 
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ctgttgagat ccagttcgat gtaacccact 
actttcacca gcgtttctgg gtgagcaaaa 
ataagggcga cacggaaatg ttgaatactc 
atttatcagg gttattgtct catgagcgga 
caaatagggg ttccgcgcac atttccccga 
tcattttgta gaacaaaaat gcaacgcgag 
ctgcattttt acagaacaga aatgcaacgc 
tgcttcattt ttgtaaaaca aaaatgcaac 
ctgagctgca tttttacaga acagaaacgc 
atctatactt cttttttgtt ctacaaaaat 
gcatcttaga ttactttttt tctcctttgt 
tttgcactgt aggtccgtta aggttagaag 
ataaaaaaag cctgactcca cttcccgcgt 
rtttttcaag ataaaggcat ccccgattat 
tgtgaacaga aagtgatagc gttgatgatt 
ctcctatttt gtctctatat actacgtata 
attcactcta tgaatagttc ttactacaat 
aacataaaaa atgtagaggt cgagtttaga 
taggttatat agggatatag cacagagata 
cgtggaagcg gtattcgcaa tattttagta 
tttgaaagtg cgtcttcaga gcgcttttgg 
ttctagagaa taggaacttc ggaataggaa 
cgaaaatgca acgcgagctg cgcacataca 
gtgttgcctg tatatatata tacatgagaa 
gcgtacttat atgcgtctat ttatgtagga 
tatcccattc catgcggggt atcgtatgct 
atgctgccac tccccaattg gattagtctc 
tggatcatat taagaaacca ttattatcat 
gaggcccttt cgtctcgcgc gtttcggtga 
cccggagacg gtcacagctt gtctgtaagc 
cgcgtcagcg ggtgttggcg ggtgtcgggg 
tgtactgaga gtgcaccata gatcaacgac 
aatagacagc atcgtaatat atgtgtactt 
aagatattct ttattgaaaa atagcttgtc 
tctttttttg ccgattaaga attaattcgg 
gagggcattg gtgactattg agcacgtgag 
ggagtatgtc tgttattaat ttcacaggta 
tgcagagcac agaggccgca gaatgtgctc 
tatgtgtgcc caatagaaag agaacaattg 
ttgtaaaagc atataaaaat agttcaggca 
atcaacctaa ggaggatgtt ttggctctgg 
tgcatggaga tgagtcgtgg caagaatacc 
gactcgtatt tccaaaagac tgcaacatac 
cgtttattcc cttgtttgat tcagaagcag 
cgatttctga ctgggttgga aggcaagaga 
gtggactgac gccagaaaat gttggtgatg 
atgtaagcgg aggtgtggag acaaatggtg 
tcaaaaatgc taagaaatag gttattactg 
cttgccgatc tatgcggtgt gaaataccgc 
ggaaattgta aacgttaata ttttgttaaa 
attttttaac caataggccg aaatcggcaa 
gatagggttg agtgttgttc cagtttggaa 
caacgtcaaa gggcgaaaaa ccgtctatca 
ctaatcaagt tttttggggt cgaggtgccg 
cccccgattt agagcttgac ggggaaagcc 
agcgaaagga gcgggcgcta gggcgctggc 
cacacccgcc gcgcttaatg cgccgctaca 
gcgcaactgt tgggaagggc gatcggtgcg 
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cgtgcaccca actgatcttc agcatctttt 34S: 
acaggaaggc aaaatgccgc aaaaaaggga 3540 
atactcttcc tttttcaata ttattgaagc 3600 
tacatatttg aatgtattta gaaaaataaa 3660 
aaagtgccac ctgaacgaag catctgtgct 3720 
agcgctaatt tttcaaacaa agaatctgag 3780 
gaaagcgcta ttttaccaac gaagaatctg 3840 
gcgagagcgc taatttttca aacaaagaat 3900 
aacgcgagag cgctatttta ccaacaaaga 3960 
gcatcccgag agcgctartt ttctaacaaa 4020 
gcgctctata atgcagtctc ttgataactt 4080 
aaggctactt tggtgtctat tttctcttcc 4140 
ttactgatta ctagcgaagc tgcgggtgca 4200 
attctatacc gatgtggatt gcgcatactt 4260 
cttcattggt cagaaaatta tgaacggttt 4320 
ggaaatgttt acattttcgt attgttttcg 4380 
ttttttgtct aaagagtaat actagagata 4440 
tgcaagttca aggagcgaaa ggtggatggg 4500 
tatagcaaag agatactttt gagcaatgtt 4560 
gctcgttaca gtccggtgcg tttttggttt 4620 
ttttcaaaag cgctctgaag ttcctatact 4680 
cttcaaagcg tttccgaaaa cgagcgcttc 4740 
gctcactgtt cacgtcgcac ctatatctgc 4800 
gaacggcata gtgcgtgttt atgcttaaat 4860 
tgaaaggtag tctagtacct cctgtgatat 4920 
tccttcagca ctacccttta gctgttctat 4980 
atccttcaat gctatcattt cctttgatat 5040 
gacattaacc tataaaaata ggcgtatcac 5100 
tgacggtgaa aacctctgac acatgcagct 5160 
ggatgccggg agcagacaag cccgtcaggg 5220 
ctggcttaac tatgcggcat cagagcagat 5280 
attactatat atataatata ggaagcattt 5340 
tgcagttatg acgccagatg gcagtagtgg 5400 
accttacgta caatcttgat ccggagcttt 5460 
tcgaaaaaag aaaaggagag ggccaagagg 5520 
tatacgtgat taagcacaca aaggcagctt 5580 
gttctggtcc attggtgaaa gtttgcggct 5640 
tagattccga tgctgacttg ctgggtatta 5700 
acccggttat tgcaaggaaa atttcaagtc 5760 
ctccgaaata cttggttggc gtgtttcgta 5820 
tcaatgatta cggcattgat atcgtccaac 5880 
aagagttcct cggtttgcca gttattaaaa 5940 
tactcagtgc agcttcacag aaacctcatt 6000 
gtgggacagg tgaacttttg gattggaact 6060 
gccccgaaag cttacatttt atgttagctg 6120 
cgcttagatt aaatggcgtt attggtgttg 6180 
taaaagactc taacaaaata gcaaatttcg 6240 
agtagtattt atttaagtat tgtttgtgca 6300 
acagatgcgt aaggagaaaa taccgcatca 6360 
attcgcgtta aatttttgtt aaatcagctc 6420 
aatcccttat aaatcaaaag aatagaccga 6480 
caagagtcca ctattaaaga. acgtggactc 6540 
gggcgatggc ccactacgtg aaccatcacc 6600 
taaagcacta aatcggaacc ctaaagggag 6660 
ggcgaacgtg gcgagaaagg aagggaagaa 6720 
aagtgtagcg gtcacgctgc gcgtaaccac 6780 
gggcgcgtcg cgccattcgc cattcaggct 6840 
ggcctcttcg ctattacgcc agctggcgaa 6900 
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agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 69 6G 

TTnt'^:^::^^r'n ar^rrrrr^r'arft-r' rr1-r^/^aarfr^-t--t- -h/^rTr^rrarrr^i-r^ r^arra-hr-r^r-rtn /T/-t-t-t-r^r-=j=an ^09 0 

^ ^ ^W^^ — ^^^._.^_^^^V^ ^WWWV,>-,^W»^V-* .,^<_^ 

ttaaagcctt cgagcgtccc aaaaccttct caagcaaggt tttcagtata atgttacatg 7 080 
cgtacacgcg tctgtacaga aaaaaaagaa aaatttgaaa tataaataac gttcttaata 7140 
craacataac tataaaaaaa taaataggga cctagactrc aggttgtcta actccttcct 72 00 
tt tcggttag agcggatgrg gggggagggc gtgaatgtaa gcgtgacata actaattaca 72 60 
Lgaca- cgac aaaggaaaag gggcctgttt actcacaggc ttttttcaag taggtaatta 7 32 0 
agccgtt tct gtctttttcc tt ctccaacc caeca aaggc catcttggta cttttttttt 7380 
itc::zi:i:!:tt tttttttttt ttrtrttttt tttttttttt tttttttttt tttttttttt 7440 
titttttttt tttttttttt tttttcatag aaataataca gaagtagatg ttgaattaga 7500 
iiaaaccgaa gatatataat ttattggaaa atacatagag ctttttgttg atgcgcttaa 7560 
::rgat:caatt caacaacacc accagcagct ctgatttttt cttcagccaa cttggagacg 7620 
o.itccagctt tgacgataac tggaacattt ggaattctac ccttacccaa gatcttaccg 7 680 
*£;£iccgqctg ccaaagtgtc aataactgga gcagtttcct tagaagcaga tttcaagtat 7740 
I'zz'izzzzcz tgtcttctgg gatcaatgtc cacaatttgt ccaagttcaa gactggcttc 7 8 00 
:jgaaargag cttgttgctt gtggaagtat ctcataccaa nccttaccga aataacctgg 7860 
c^'^ggtattta tccatgttaa ttctgtggtg atgttgacca ccggccatac ctctaccacc 7 92 0 
q:ugg:,gcttt ctgtgcttac cgatacgacc tttaccggct gagacgtgac ctctgtgctt 7 98 0 
-izzaqzczza gtgaatctgg aaggcattct tgattagttg gatgattgtt ctgggattta 8040 
arqcaaaaaa atcactaaga aggaaaaaaa tcaacggaga aagcaaacgc catcttaaat 8100 
atacggoaca cagatgaaag gtttgaacct atctgggaaa atacgcatta aacaagcgaa 8160 
aaactqcgag gaaaattgtt tgcgtctctg cgggctattc acgcgccaga ggaaaatagg 822 0 
aaaaataaca gggcattaga aaaataattt tgattttggt aatgtgtggg t ccc tggtgt 828 0 
acagargtta ca ttggttac agtactcttg tttttgctgt gtttttcgat gaatctccaa 8340 
aatggttqtt agcacatgga agagtcaccg atgctaagtt atctctatgt aagctacgtg 8400 
ccgtgacttt ugatgaagcc gcacaagaga tacaggattg gcaactgcaa atagaatctg 8 4 60 
gggatctaga ratccttttg ttgtttccgg gtgtacaata tggacttcct cttttctggc 8520 
aaccaaaccc atacatcggg attcctataa taccttcgtt ggtctcccta acatgtaggt 8 580 
ggcggagggg agatatacaa tagaacagat accagacaag acataatggg ctaaacaaga 8 64 0 
ctacaccaat tacactgcct cattgatggt ggtacataac gaactaatac tgtagcccta 8700 
gacttgatag coat cat cat atcgaagttt cactaccctt tttccatttg ccatctattg 8 7 60 
aagtaataat aggcgcatgc aacttctttt cttttttttt cttttctctc tcccccgttg 8820 
r tgt c tea cc atatccgcaa tgacaaaaaa aatgatggaa gacactaaag gaaaaaatta 8 8 80 
acgacaaaga cagcaccaac agatgtcgtt gttccagagc tgatgagggg tatcttcgaa 8 94 0 
cacacgaaac tttttccttc cttcattcac gcacactact ctctaatgag caacggtata 9000 
cggccttcct tccagttact tgaatttgaa ataaaaaaag tt tgccgctt tgctatcaag 9060 
cacaaataga cctgcaatta ttaatctttt gtttcctcgt cattgttctc gttccctttc 9120 
ttccttgttt ctttttctgc acaatatttc aagctata cc aagcatacaa tcaactccaa 9180 
gcttgaagca agcctcctga aagatgaagc tactgtcttc tatcgaacaa gcatgcgata 9240 
tttgccgact taaaaagctc aagtgctcca aagaaaaa cc gaagtgcgcc aagtgtctga 9300 
agaacaactg ggagtgtcgc tactctccca aaaccaaaag gtctccgctg actagggcac 9 360 
atctgacaga agtggaatca aggctagaaa gactggaa ca gctatttcta ctgatttttc 9420 
ctcgagaaga cc ttgacatg attttgaaaa tggattcttt acaggatata aaagcattgt 94 8 0 
taacaggatt a tttgtacaa gataatgtga ataaagatgc cgtcacagar agattggctt 954 0 
cagtggagac tgatatgcct ctaacattga gacagcatag aataagtgcg acatcatcat 9600 
cggaagagag tagtaacaaa ggtcaaagac agttgactgt accgccggaa ttcatggctt 9660 
acccatacga tgttccagat tacgctagct tgggtggtca 97 00 

<210> 92 

<211> 9749 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
PGC1003 



<400> 92 
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agaatctggg 
tttctggcaa 
atgtaggtgg 
aaacaagact 
tagccctaga 
atctattgaa 
ccccgttgtt 
aaaaattaac 
tcttcgaaca 
acggtatacg 
ctatcaagta 
tccctttctt 
aactccaagc 
atgcgatatt 
gtgtctgaag 
tagggcacat 
gatttttcct 
agcattgtta 
attggcttca 
atcatcatcg 
catggcttac 
gtgctgtaaa 
ggaaaaagca 
gcatcaacat 
tcctcct eta 
ttcaacaaga 
aggatctggt 
taattatgct 
acaaaatatt 
tggagcaaga 
tgtttcggac 
agtcatcgtc 
tcctgttgtt 
cacattgtat 
cttgaatacc 
atggtgccac 
aaattctttg 
agcttatcga 
ggaaagtgta 
tttaaatgaa 
cgatggccct 
ccttcattgc 
agtaattgtc 
gccggagctc 
atttcttatg 
aattttaaag 
tgtaggtcag 
ccggcatgca 
gctcacaatt 
atgagtgagg 
cctgtcgtgc 
tgggcgctct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 



gatctagata 
ccaaacccat 
cggaggggag 
acaccaa tta 
cttgatagcc 
gtaataatag 
gtctcaccat 
gacaaagaca 
cacgaaactt 
gccttccttc 
taaatagacc 
ccttgtttct 
ttgaagcaag 
tgccgactta 
aacaactggg 
ctgacagaag 
cgagaagacc 
acaggattat 
gtggagactg 
gaagagagta 
ccatacgatg 
cgtggcaatt 
ggaggaatct 
ggaacattgc 
ccacattcaa 
aacgctggct 
ggaaaacgga 
acgttatatg 
gttgctgcgc 
ttaattgtac 
accttaacag 
atcggacaat 
gtatccttcc 
gctgatgaag 
aacatgtttg 
gttatgacat 
agtgctgcaa 
agaccatttg 
tggaaacagg 
aaaggaaatc 
gaagttgtgg 
tcatrgaaat 
tatcaaaagg 
tatgatgcag 
atttatgatt 
tgactcttag 
gttgctttct 
agcttggcgt 
ccacacaaca 
taactcacat 
cagctggatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 



tccttttgtt 
acatcgggat 
atatacaata 
cactgcctca 
atcatcatat 
gcgcatgcaa 
atccgcaatg 
gcaccaacag 
tttccttcct 
cagttacttg 
tgcaattatt 
ttttctgcac 
cctcctgaaa 
aaaagctcaa 
agtgtcgcta 
tggaatcaag 
ttgacatgat 
ttgtacaaga 
atatgcctct 
gtaacaaagg 
ttccagatta 
caaaaaagtc 
actattctga 
tcggagagaa 
caactcttcg 
cacgagctgc 
caatgcttag 
attatatgga 
agattgatag 
ctgaattggc 
atcaacctca 
gtgatgtttc 
gtcattgtgc 
gatccggatg 
tacaattcga 
attccttagc 
aaagagttca 
aattgagagt 
aagatggatc 
tttgcatttg 
aaatctcgga 
tccgaccaaa 
caagctcaac 
ccccggggcg 
tttattatta 
gttttaaaac 
caggtatagc 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
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gt ttccgggt 
tcctataata 
gaacagatac 
ttgatggtgg 
cgaagrttca 
cttcttttct 
acaaaaaaaa 
atgtcgttgt 
tcattcacgc 
aatttgaaat 
aatcttttgt 
aatatttcaa 
gatgaagcta 
gtgctccaaa 
ctctcccaaa 
gctagaaaga 
tttgaaaatg 
taatgtgaat 
aacattgaga 
tcaaagacag 
cgctagcttg 
gaaaccattg 
gcctccaggt 
aatatcatcg 
atctggaaaa 
tttaattcaa 
aacttcaagc 
agacaagagt 
caatggagca 
agtggaagga 
cctcaaacca 
aatgtcagct 
ttcaactttt 
gcagaaggct 
gcaacctggg 
gaggttaatg 
tttagctgta 
ttattgtgtt 
tcgtctactt 
tattgaagat 
aacacaacac 
ggaaatcaac 
ggagccaatg 
tcgacctgca 
aataagttat 
gaaaattctt 
atgaggtcgc 
atagctgt tt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 



gracaatatg 
ccttcgttgg 
cagacaagac 
tacataacga 
ctaccctttt 
ttttttttct 
tgatggaaga 
tccagagctg 
acactactct 
aaaaaaagtt 
ttcctcgtca 
gccataccaa 
ctgtcttcta 
gaaaaaccga 
accaaaaggt 
ctggaacagc 
gattctttac 
aaagatgccg 
cagcatagaa 
ttgactgtat 
ggtggtcata 
aaaccgcaga 
gttcgaagat 
tgttctcaat 
agtgcattca 
gaatgtagtt 
tcaaattgct 
gttcttggat 
cgattgtctt 
gagaaaatgc 
atcgaatccg 
catgataata 
ccacgtgaca 
gtaacaattg 
aagaagaatg 
ctagctggtc 
tttggt ccaa 
cctgaaactg 
tgtgaatcaa 
gttattcctg 
agatttgttg 
ggctcacagt 
gtgatggaag 
gccaagctaa 
aaaaaaaata 
gttcttgagt 
tcttattgac 
cctgtgtgaa 
tgtaaagcct 
cccgctttcc 
gggagaggcg 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
ratctcagtt 



gact ucctct 
tctccctaac 
ataatgggct 
actaatactg 
tccat ttgcc 
tttctctctc 
cactaaagga 
atgaggggta 
ctaatgagca 
tgccgctttg 
ttgttctcgt 
gcatacaatc 
tcgaacaagc 
agtgcgccaa 
ctccgctgac 
tatttctact 
aggatataaa 
tcacagatag 
taagtgcgac 
cgccggaatt 
tggccatgga 
aaatgaattc 
tacttttgga 
attttgagcc 
gtggatattc 
cgagtagttc 
cggatgacga 
tagacacttc 
tatcaaaaag 
tttatctggc 
ctctctcccc 
ttctccgtcg 
attggcaatt 
gagaagagaa 
atggatttgg 
atccgaggag 
ctgaaatgtc 
gagcagcaat 
atgattttat 
gattctcatg 
ctcaaaatgg 
tttcaacgag 
tgagcaatga 
ttccgggcga 
agtgtataca 
aactctttcc 
cacacctcta 
attgttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt 3540 
atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc 3500 
agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa 3660 
gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa 3720 
gccagttacc ttcggaaaaa gagttgg-cag ctcttgatcc ggcaaacaaa ccaccgctgg 3780 
tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga 3840 
agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg 3900 
gatttcggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg 3960 
aagttttaaa tcaatctaaa gtatatatga gcaaacttgg tctgacagtt accaatgctt 4020 
aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 4080 
ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 4140 
gacaccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 4200 
aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 4260 
trgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 4320 
tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 4380 
ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 4440 
cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 4500 
agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 4560 
gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 4620 
gtcaatacgg gataacaccg cgccacatag cagaacttta aaagtgctca tcattggaaa 4680 
acgttctrcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 4740 
acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 4800 
agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 4860 
aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat 4920 
gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 4980 
tccccgaaaa gtgccacctg aacgaagcat ctgtgcttca ttttgtagaa caaaaatgca 5040 
acgcgagagc gctaattttt caaacaaaga atctgagctg catttttaca gaacagaaat 5100 
gcaacgcgaa agcgctattt taccaacgaa gaatctgtgc ttcatttttg taaaacaaaa 5160 
atgcaacgcg agagcgctaa tttttcaaac aaagaatctg agctgcattt ttacagaaca 5220 
gaaatgcaac gcgagagcgc tattttacca acaaagaatc tatacttctt ttttgttcta 5280 
caaaaatgca tcccgagagc gctatttttc taacaaagca tcttagatta ctttttttct 5340 
cctttgtgcg ctctataatg cagtctcttg ataacttttt gcactgtagg tccgttaagg 5400 
ttagaagaag gctactttgg tgtctatttt ctcttccata aaaaaagcct gactccactt 5460 
cccgcgttta ctgattacta gcgaagctgc gggtgcattt tttcaagata aaggcatccc 5520 
cgattatatt ctataccgat gtggattgcg catactttgt gaacagaaag tgatagcgtt 5580 
gatgattctt cattggtcag aaaattatga acggtrtctt ctattttgtc tcratatact 5640 
acgtatagga aatgtttaca ttttcgtatt gttttcgatt cactctatga atagttctta 5700 
ctacaatttt tttgtctaaa gagraatact agagataaac ataaaaaatg tagaggtcga 5760 
gtttagatgc aagttcaagg agcgaaaggt ggatgggtag gttatatagg gatatagcac 5820 
agagatatat agcaaagaga tacttttgag caatgtttgt ggaagcggta ttcgcaatat 5880 
tttagtagct cgttacagtc cggtgcgttt ttggtttttt gaaagtgcgt cttcagagcg 5940 
cttttggttt tcaaaagcgc tctgaagttc ctatactttc tagagaatag gaacttcgga 6000 
ataggaactt caaagcgttt ccgaaaacga gcgcttccga aaatgcaacg cgagctgcgc 6060 
acatacagct cactgttcac gtcgcaccta tatctgcgtg ttgcctgtat atatatatac 6120 
atgagaagaa cggcatagtg cgtgtttatg cttaaatgcg tacttatatg cgtctattta 6180 
tgtaggatga aaggtagtct agtacctcct gtgatattat cccattccat gcggggtatc 6240 
gtatgcttcc ttcagcacta ccctttagct gttctatatg ctgccactcc tcaattggat 6300 
tagtctcatc cttcaatgct atcatttcct ttgatattgg atcatattaa gaaaccatta 6360 
ttatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt ctcgcgcgtt 6420 
tcggtgatga cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc 6480 
tgtaagcgga tgccgggagc agacaagccc gtcagggcgc gtcagcgggt gttggcgggt 6540 
gtcggggctg gcttaactat gcggcatcag agcagattgt actgagagtg caccatagat 6600 
caacgacatt actatatata taatatagga agcatttaat agacagcatc gtaatatatg 6660 
tgtactttgc agttatgacg ccagatggca gtagtggaag atattcttta ttgaaaaata 6720 
gcttgtcacc ttacgtacaa tcttgatccg gagcttttct ttttttgccg attaagaatt 6780 
aattcggtcg aaaaaagaaa aggagagggc caagagggag ggcattggtg actattgagc 6840 
acgtgagnat acgtgattaa gcacacaaag gcagcttgga gtatgtctgt tattaatttc 6900 
acaggtagtt ctggtccatt ggtgaaagtt tgcggcttgc agagcacaga ggccgcagaa 6960 
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tgtgctctag 
acaattgacc 
tcaggcactc 
gctctggtca 
gaataccaag 
aacatactac 
gaagcaggtg 
caagagagcc 
ggtgatgcgc 
aa tggtgtaa 
artactgagt 
ataccgcaca 
tgttaaaatt 
tcggcaaaat 
tttggaacaa 
tctatcaggg 
ggtgccgtaa 
gaaagccggc 
cgctggcaag 
cgctacaggg 
cggtgcgggc 
taagttgggt 
caagctttcg 
accttctcaa 
aaaagaaaaa 
atagggacct 
ggagggcgtg 
cctgtttact 
ttcaacccac 
tttttttttt 
ttcatagaaa 
t tggaaaata 
agcagctctg 
aacatttgga 
aactggagca 
caatgtccac 
gaagtatctc 
rgtggtgatg 
tacgaccttt 
gcattcttga 
aaaaaaatca 
tgaacctatc 
grctctgcgg 
araattttga 
actcttgttt 
grcaccgatg 
caagagatac 



atrccga tgc 
cggttattgc 
cgaaatactt 
atgat tacgg 
agttcctcgg 
tcagtgcagc 
ggacaggtga 
ccgaaagctt 
ttaga ttaaa 
aagactctaa 
agtatttatt 
gatgcgtaag 
cgcgttaaat 
cccttataaa 
gagtccacta 
cgatggccca 
agcactaaat 
gaacgtggcg 
tgtagcggtc 
cgcgtcgcgc 
ctcttcgcta 
aacgccaggg 
cgagctcgag 
gcaaggtttt 
tttgaaatat 
agacttcagg 
aatgtaagcg 
cacaggcttt 
caaaggccat 
tttttttttt 
taatacagaa 
catagagctt 
attttttctt 
a t tctaccct 
gtttccttag 
aatttgtcca 
ataccaancc 
ttgaccaccg 
accggctgag 
ttagttggat 
acggagaaag 
tgggaaaata 
gctattcacg 
ttttggtaat 
ttgctgtgtt 
ctaagttatc 
aggattggca 



tgacttgctg 
aaggaaaatt 
ggttggcgtg 
cattgatatc 
tttgccagtt 
ttcacagaaa 
acttttggat 
acattttatg 
tggcgttatt 
caaaatagca 
taagtattgt 
gagaaaatac 
ttttgttaaa 
tcaaaagaat 
ttaaagaacg 
ctacgtgaac 
cggaacccta 
agaaaggaag 
acgctgcgcg 
cattcgccat 
ttacgccagc 
ttttcccagt 
atcccgagct 
cagtataatg 
aaataacgtt 
ttgtctaact 
tgacataact 
tttcaagtag 
cttggtactt 
tttttttttt 
gtagatgttg 
tttgttgatg 
cagccaactt 
tacccaagat 
aagcagattt 
agttcaagac 
ttaccgaaat 
gccatacctc 
acgtgacctc 
gattgttctg 
caaacgccat 
cgcattaaac 
cgccagagga 
gtgtgggtcc 
tttcgatgaa 
tctatgtaag 
actgcaaat 



134 
ggtattarat 
tcaagtcttg 
tttcgtaatc 
gtccaaccgc 
attaaaagac 
cctcattcgt 
tggaactcga 
ttagctggtg 
ggtgttgatg 
aatttcgtca 
ttgtgcactt 
cgcatcagga 
tcagctcatt 
agaccgagat 
tggactccaa 
catcacccta 
aagggagccc 
ggaagaaagc 
taaccaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttgcaaatta 
ttacatgcgt 
cttaatacta 
ccttcctttt 
aattacatga 
gtaattaagt 
tttttttttt 
tttttttttt 
aattagatta 
cgcttaagcg 
ggagacgaat 
cttaccgtaa 
caagtattgg 
tggcttccag 
aacctggatg 
taccaccggg 
tgtgctttct 
ggatttaatg 
cttaaatata 
aagcgaaaaa 
aaataggaaa 
ctggtgtaca 
tctccaaaat 
ctacgtggcg 



gtgtgcccaa 
taaaagcata 
aacctaagga 
atggagatga 
tcgtatttcc 
ttactccctt 
tttctgactg 
gactgacgcc 
taagcggagg 
aaaatgctaa 
gccgatctat 
aattgtaaac 
ttttaaccaa 
agggttgagt 
cgtcaaaggg 
atcaagtttt 
ccgatttaga 
gaaaggagcg 
acccgccgcg 
caactgttgg 
gggatgtgct 
taaaacgacg 
aagccttcga 
acacgcgtct 
acataactat 
cggttagagc 
tatcgacaaa 
cgtttctgtc 
tttttttttt 
tttttttttt 
aactgaagat 
atcaattcaa 
ctagctttga 
ccggctgcca 
tctctcttgt 
aaatgagctt 
gtatttatcc 
gtgctttctg 
agtcttagtg 
caaaaaaatc 
cgggatacag 
ctgcgaggaa 
aataacaggg 
gatgttacat 
ggttgttagc 
tgacttttga 



tagaaagaga 
taaaaatagt 
ggatgttttg 
gtcgtggcaa 
aaaagactgc 
gtttgattca 
ggttggaagg 
agaaaatgtt 
tgtggagaca 
gaaataggtt 
gcggtgtgaa 
gttaatattt 
taggccgaaa 
gttgttccag 
cgaaaaaccg 
ttggggtcga 
gcttgacggg 
ggcgctaggg 
cttaatgcgc 
gaagggcgat 
gcaaggcgat 
gccagtcgtc 
gcgtcccaaa 
gtacagaaaa 
aaaaaaataa 
ggatgtgggg 
ggaaaagggg 
tttttccttc 
tttttttttt 
tttttttttt 
atataattta 
caacaccacc 
cgataactgg 
aagtgtcaat 
cttctgggat 
gttgcttgtg 
atgttaattc 
tgcttaccga 
aatctggaag 
actaagaagg 
atgaaaggtt 
aattgtttgc 
cattagaaaa 
tggttacagt 
acatggaaga 
tgaagccgca 



702C 
7080 
7140 
720: 
7260 
7320 
7380 
7440 
7 500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9749 



<210> 93 
<211> 1415 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 93 

Met lie Leu Arg His Phe Gly Phe lie Leu lie 
15 10 



lie Leu Thr Val Tyr 
15 



Leu Ser Ser Thr His Ala Thr Thr Arg Lys His His Arg Arg Ser lie 
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20 25 30 

Asp Asn Asp Ala Arg Asn Leu Gly Phe Gin Phe Val Met Giu Pro Arg 

35 40 45 

Arg Asn Val Thr Val Met Glu Ser Ser Ser Hxs Leu Leu Glu Cys Ser 
50 55 60 

Tyr Val Leu Ala His Glu Arg Leu Val His Asp Val Arg lie Glu Trp 
65 70 75 80 

Lys Arg Asp Gly Val Leu Leu Ser Glu Arg Thr Ser Ser Arg lie Lys 

85 90 95 

Val Met Ser Asn Gly Ser Leu Trp lie Glu Ser Val Ser Ser Ala Glu 
100 105 110 

Glu Gly Thr Tyr Gin Cys Ala Val His Val Thr Thr Lys Ser Asp Gin 
115 120 125 

Thr Ser Asp Thr Trp Thr Phe Leu Ser Arg Lys Ala Thr Leu Arg Leu 
130 135 140 

Ala Asp Leu Ala Lys Phe Glu Leu Gin Ala lie Asp Arg Thr Leu Ala 
145 150 155 160 

Lys Gly Gin Pro Thr Ala Phe His Cys Leu lie Asn Ser Lys Pro Thr 
165 170 175 

Pro Thr Ala Val Trp Leu His Asn Asp Glu Pro lie Val Asn Gly Gly 
180 185 190 

Glu Tyr His lie Leu Pro Val Ser Asn Thr Leu Glu lie Ser Ser Thr 
195 200 205 

Gin Ser Arg His Glu Gly Thr Tyr Arg Cys Thr Val Glu Gly Ala Gly 
210 215 220 

Lys Arg Arg Ser Ser Gin Thr Ala Arg Leu Thr Val Thr Thr Glu Thr 
225 230 235 240 

Val Ser Asn Glu Leu Val Phe lie Thr Thr Pro Arg Leu Gin Val Val 
245 250 255 

Glu Gin Gly Asp Glu Phe Leu Leu Glu Cys Leu Val Ala Ser Leu lie 
260 265 270 

Arg Pro Gin Val Arg Tro Leu Lys Asp Ser Arg Gin lie lie Val Asp 
275 280 285 

Gly Val Arg lie Arg Arg Val Gly Val Ser Ser lie Leu Val Ser Arg 
290 295 300 

Ala Ser He Glu Asp Thr Gly Leu Tyr Thr Cys Arg Ala Ser Asn Asn 
305 310 315 320 

Asp Asp Ser He Asp Arg Ala Val Ser Val Glu Val Arg Ala Pro Pro 
325 330 335 
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Arg lie Thr Thr Arg Pro Thr Thr Lys Val Ala Val Glu Thr Ala Asp 
340 345 350 

Val Glu Leu Glu Cys Gly Thr Ala Ala Ala Arg Pro Glu Ala Arg Val 
355 360 365 

Asn Trp Tyr Lys Asn Gly Glu Ala lie He Gly Ser Glu Tyr Phe Val 
370 375 380 

He Glu Pro Asn Arg Leu Arg He Leu Gly Val Val Arg Ala Asp Gin 
385 390 395 400 

Ala He Tyr Gin Cys He Ala Glu Asn Asp Val Gly Ser Glu Gin Ala 

405 410 415 

Ser Ala Gin Leu Leu Val Asp Ala Pro Asp Ser Ser Ser Val Ala Ala 
420 425 430 

Ser Ser Gly Val Pro Met Thr Ser Ser Ala Pro Leu Gly Leu Arg Ser 
435 440 445 

Thr Ser Ser Gly Ser Arg Phe He Asn Val Glu Trp Asp Pro Pro Val 
450 455 460 

Gin Arg Asn Gly Asn He Met Arg Tyr His He Phe Tyr Lys Asp Asn 
465 470 475 480 

Leu He Asp Arg Glu Arg Met He Asn Ser Ser Ser Thr Ser Ala Thr 
485 490 495 

Leu Thr Ser Leu Gin Pro Ser Thr Met Tyr Leu He Arg Val Thr Ala 
500 505 510 

Glu Asn Glu Ala Gly Met Gly Lys Phe Ser Asp Ser Leu Lys Val Thr 
515 520 525 

Thr Asn Lys Glu Gin Ala Val Pro Gly Lys Val Ala Ser Leu Thr Thr 
530 535 540 

Thr Ala Thr Gly Pro Glu Thr He Asp He Arg Trp Ser Pro Pro Ser 
545 550 555 560 

Gly Gly Gin Pro Ala Leu Arg Tyr Lys He Phe Tyr Ser His Asp Pro 
565 570 575 

Leu Glu Lys Asn Glu Lys Glu Thr Leu He Thr Thr Ser Thr Thr His 
530 585 590 

Tyr Thr Leu His Gly Met Asp Lys Tyr Thr Gly Tyr Gin He Arg He 
595 600 605 

Glu Ala Glu Gly Ser Asn Gly Ser Gly Leu Ser Ser Asp Thr Val Lys 
610 615 620 

Val Arg Thr Gin Ser Asp Glu Pro Ser Ala Pro Pro Val Asn He Gin 
625 630 635 640 
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Ala Glu Ala Asp Ser Ser Thr Ser Val Arg Val Ser Trp Asp Glu Pro 

645 650 655 

Glu Glu Glu Ser Val Asn Gly Glu lie Thr Gly Tyr Arg Leu Lys Tyr 
660 665 670 

Lys Thr Lys Ala Arg Gly Ala Lys Gly Asn Thr Leu Val lie Asp Ala 
675 680 685 

Thr Ala Arg Glu Tyr Thr Met Gly Asn Leu Glu Pro Asn Thr Gin Tyr 
690 695 700 

Leu lie Arg Met Ala Val Val Asn His Asn Gly Thr Gly Pro Phe Ser 
705 710 715 720 

Asp Trp Val Ser lie Asp Thr Pro Gly Gin Asp Lys Glu Glu Arg Thr 

725 730 735 

Leu Gly Ala Pro Arg Glu lie Arg Pro His Ala Gly lie Asp Tyr lie 
740 745 750 

Leu Val Ser Trp Leu Pro Pro Ala Asp Glu Gin Asn Leu Val Arg Gly 
755 760 765 

Tyr Gin lie Gly Trp Gly Leu Ser Val Pro Asp Thr Glu Thr lie Arg 
770 775 780 

Val Thr Ala Ser Thr Thr Gin Tyr Lys lie Ala Arg Leu His Ser Glu 
785 790 795 800 

Arg Asp Tyr Val lie Ser Leu Arg Ala Phe Asn Asn Leu Gly Ser Gly 

805 810 815 

Phe Pro lie Tyr Glu Thr Val Arg Thr Leu Ser Arg Glu Thr Pro Ser 
820 825 830 

His Phe Asn Glu Asp Ser Asp Ser Asp Asp Ser Asp Val Gly Ser Ser 
835 840 845 

Glu Ser Thr Pro Val Gly Val Arg Ala Glu Ala lie Ser Ala Thr Ser 
850 855 860 

lie Arg Val Met Trp Thr Glu Ser Asp Glu Thr Ala Phe Asn Thr Gin 
865 870 875 880 

Tyr Thr Val Arg Tyr Ser Thr Ala Val Asp Gly Asn Gin His Arg Tyr 
885 890 895 



Val Asn Ser Thr Glu Thr Trp Ala 
900 



Ser Thr Trp Ser Met Ala Thr Arg 
930 935 

Ser Ser Ala Pro Arg Asp Leu Thr 



Thr Val Glu Gly Leu Arg Pro Ala 
905 910 

Ser Asn Gly Gin Leu 
925 

Asn Arg Thr Leu Ala Ala Pro Pro 
940 

Val Leu Pro Ala Glu Ser Gly Asp 



Thr Glu Tyr Glu Phe Ala Val Arg Ala Val Ala 
915 920 



wo 00/73328 



PCT/E POO/05 108 



138 

945 950 955 960 

Pro His Ser Ser Ser Leu His Trp Gin Pro Pro Lvs Tyr Ser Asn Giy 

965 970 ' 975 

Glu lie Giu Glu Tyr Leu Val Phe Tyr Thr Asp Arg Ala Ser Leu Ala 
980 985 990 

Asp Lys Asp Trp Thr lie Asn Tyr Val Ala Gly Asp Lys Leu Ser His 
995 1000 1005 

Gin Val Ser Asn Leu Leu Pro Lys Ala Asn Tyr Phe Phe Lys lie Gin 
1010 1015 1020 

Ala Arg Asn Glu Lys Gly His Gly Pro Phe Ser Ser Val Val Gly Tyr 
1025 1030 1035 1040 

Thr Pro Ser Gly Gly Ala lie Leu Ser Gly Lys Asp Arg His Asn Ala 
1045 1050 ^ 1055 

Arg Gly His Gly Ser Ala Ala Ser Gly Asp Thr Val Ser Leu Val Asp 
1060 1065 1070 

Gin Leu Gin Ser Leu Leu His Ser Asn Pro Leu Tyr Leu lie Leu Leu 
1075 1080 1085 

Ala Ala Phe Ala Leu lie Leu lie Leu Thr Leu lie Leu lie lie Met 
1090 1095 1100 

Cys Cys Trp Lys Arg Ser Ser Gly Gly Gly Arg Lys Asn Gly Tyr Gin 
1105 1110 1115 1120 

Ser Gly Lys Lys Thr Ser Ala Gly Ala Gly Ser Gly Gly Gly lie Giy 
1125 1130 1135 

Gly Leu Gly Gly Pro Pro Asn Asp Leu Trp lie Asn Gly Thr Gly Ser 
1140 1145 1150 

His Met Arg Ala Gly Ala Ser Asp Tyr Met Val Asp Gly Leu Ala Thr 
1155 1160 " 1165 

Ala His Leu Thr Ala Ala Asp lie Glu Ser Pro Thr Pro Arg Tyr His 
1170 1175 1180 

His Leu Gin Gly Gin Gly Thr Leu Thr Arg Ser Tyr His Gin Ser Ser 
1185 1190 1195 1200 

Gin Ser Leu Glu Gly Arg Gin Arg Thr Pro Gin Val Val Tyr Thr Gly 
1205 1210 1215 

Thr Gly Arg His Gin Pro lie Gin Arg lie Asp Phe Glu Ser Pro Tyr 
1220 1225 1230 

Gly Ser Ser Ser Ala lie Gly Ser Ala Ser Thr Pro Pro Leu Pro Met 
1235 1240 1245 

Gin Ala Pro Pro Ser Gly Pro Pro Thr Val lie Asp Gly Tyr Arg Thr 
1250 1255 1260 
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Leu Ary Gly Tax Pro Pro Asn Ser Ser Ala Ala Asa Ala Leu Arg Ser 
1265 1270 1275 1280 

Phe Thr Gin Leu Ala Gly Ala Thr Pro Pro Pro Pro His Ser Ala Ala 
1285 1290 1295 

Ser Ser Ser Ser Arg Pro Thr lie lie Ala Ala Gly Gly Arg Gin Val 
1300 1305 ^1310 

Pro Val Gly Arg Ala Thr Ala Gin Pro Arg Val Asn Val Ala Asn lie 
1315 1320 1325 

Tyr Ser Pro Phe Ala Ser Cys Ser Ala Ser Ser Asp Ala Gly Glu Ser 
1330 1335 1340 

Asp Lys Lys Ser Gly Glu Cys Met Glu Met Arg Glu Thr Thr Pro lie 
1345 1350 1355 1360 

Lys Ser Asn Thr Ala Gly Ser Ser Asn Gly Glu Lys Met Asn Thr Asn 
1365 1370 1375 

Met Asn Pro Ser His Ser Ala Glu Asp Leu Asn Ala His Leu Glu Asn 
1380 1385 1390 

Leu Asp Thr Met Leu Asp Asp Leu Gin Gin Leu Gin His Asn Leu His 
1395 1400 1405 



Phe Glu Thr Ser Met Asp Lys 
1410 1415 



<210> 94 
<211> 4248 
<212> DNA 

<213> Caenorhabdi tis elegans 



<400> 94 

atgattttgc 

catgccacaa 

tttcaattcg 

ttagaatgta 

aaacgtgatg 

ggttcattat 

catgtgacta 

acgttacgat 

aaagggcagc 

tggcttcaca 

aacactttgg 

gaaggtgctg 

gtatcgaatg 

gaattcttac 

gattctcgtc 

cttgtctcac 

gatgattcaa 

cgacctacca 

gctgctagac 

gagtattttg 



gacatttcgg 
cacggaaaca 
taatggaacc 
gttatgttct 
gtgtgttgct 
ggatagaatc 
caaaaagcga 
tggcggattt 
caactgcgtt 
acgatgaacc 
agattagttc 
ggaagagaag 
agctagtttt 
ttgaatgcct 
aaattattgt 
gtgcttctat 
ttgatcgagc 
cgaaagttgc 
cagaggcacg 
taatrgagcc 



tttcattctt 
tcatcgtcga 
acgacggaat 
ggctcatgag 
cagtgaacgg 
agtgagcagt 
tcaaacaagt 
ggcaaagttt 
tcattgtctt 
tattgtaaat 
cactcaatct 
aagtagccaa 
catcacaacg 
tgttgcctca 
cgatggagtt 
cgaagatact 
tgtatctgta 
tgttgaaact 
cgttaattgg 
gaacagactc 



ataattctga 
tcgattgata 
gtgactgtta 
agacttgtac 
acatcttcta 
gccgaagaag 
gatacgtgga 
gaactgcaag 
attaactcga 
ggtggcgaat 
agacacgaag 
actgcacgat 
ccacgtctac 
ctaatcagac 
cgtattcgac 
gggctataca 
gaagttcgtg 
gctgatgttg 
tacaaaaatg 
cgcatccttg 



cagtttatct 
atgatgctcg 
tggaatcatc 
acgatgtccg 
gaataaaagt 
gaacatatca 
catttttgag 
cgattgatcg 
aaccaacccc 
atcacattct 
ggacgtatag 
tgaccgttac 
aagttgttga 
ctcaagttag 
gtgttggcgt 
cttgtcgagc 
ctccgccgag 
aattagagtg 
gagaagcaat 
gagtcgtacg 



gtcatctact 
gaatcttggc 
gtctcattta 
aat tgaatgg 
aatgtcaaat 
atgcgctgta 
tcgaaaagct 
aactctagca 
aacagcagtt 
accagtgtca 
gtgtacggta 
tactgaaaca 
acaaggcgac 
atggctcaaa 
atcgtcaatt 
ttcgaataat 
aatcactaca 
tggaacggcg 
aattgggagc 
agccgatcaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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gcaatctacc aatgtattgc agaaaatgat 
cttgttgatg ctccagattc atcatcagtg 
tctgcaccgt taggcctccg ttcgacgtct 
gatccaccag ttcaaagaaa tgggaatatt 
ttaatcgaca gagaaaggat gatcaattca 
caaccatcaa caatgtattt aattcgagtt 
Ltcaqngatt cacrtaaagt taccacaaac 
Lcactaacaa caactgcaac tggccccgaa 
c^aggacaac ctgcccttcg atacaaaata 
aaaaaagaaa cattgattac gacctcgaca 
tacaccggat atcagattcg aattgaagct 
cacactgtga aagtacgaac acaatccgat 
acagaagctg atagctcaac aagtgttcga 
actaacggag aaattactgg atatcggttg 
zacaazaczc ttgtgattga cgccactgct 
aat.acacagt atctgataag aatggcagta 
lazzgqgztt ctattgatac tcctggacaa 
a::agagartc gaccacatgc tggaattgat 
aa::gaacaga atcttgttag aggatatcaa 
qaaacaattc gtgtaactgc ttccacaact 
cgagaitatg tgatttctct tcgagcattt 
gaQaczgzcc gaacattgtc tcgtgaaact 
qacgar-tctg acgttggatc ctcagaatct 
tctgcaacat caatacgtgt tatgtggacg 
tacacagttc gatatagtac agctgttgat 
gagacarggg ctactgtgga aggactacga 
gcrgttgcat caaatgggca gttgagtact 
gcggctccac caagttcagc tccacgtgat 
cczcaczctt caagtcttca ttggcaacct 
tatttagcgt tttatacgga tcgagcttct 
gtagcrggcg ataagttgtc acatcaagtc 
ttcaagattc aagcaagaaa tgagaaaggt 
actccarcag gcggtgctat tctatcagga 
tcagcggcrt ctggcgatac agtttcctta 
aacccattgt atctcatact tttggctgcg 
cttatcacaa tgtgttgttg gaaacgatct 
tctggaaaaa agacgagtgc cggagctgga 
ccacctaacg atttatggat taatggaact 
tatatggttg atggtctagc aaccgcccat 
cctagatacc atcatctaca agggcaagga 
caaagtcttg aaggacgaca acgaacacct 
cagccgattc aaagaatcga ctttgaaagt 
gcctcgactc cgcctctccc aatgcaagct 
ggataccgta ctctacgtgg aacacctcca 
ttcactcaat tggcaggagc aacaccacct 
cgaccgacta ttattgcagc cggtggacgg 
ccacgtgeca atgttgcaaa tatctacagt 
gcaggagaat cagataaaaa gagtggagaa 
aaatctaata cagctggttc atcgaatgga 
cactcggcag aagatctcaa cgcacatctt 
caacaattac agcacaattt gcattttgag 
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gttggaagtg aacaagctag tgcacaactt 1260 
gctgcttcct caggagttcc aatgacttcg 1320 
tctggttctc gctttataaa tgttgaatgg 1380 
atgagatatc atatattcta caaagataat 1440 
tcaagtacat ctgcaacgtt aacatctcta 1500 
actgctgaaa atgaagctgg aatgggaaag 1560 
aaagaacaag cagttcctgg aaaagtcgca 1620 
acgattgaca tccgttggag cccaccaagt 1680 
ttctattcgc atgatccati: ggagaaaaat 1740 
acacattata cacttcatgg aatggataaa 1800 
gaaggatcca atggatcggg gcttccaagt 1860 
gaaccatcag caccgcctgt aaacattcaa 1920 
gttagttggg atgagccaga agaggaatct 1980 
aagtacaaga caaaagctcg tggagccaaa 2040 
agagagtata ctatgggaaa tttggaaccg 2100 
gtgaatcaca atggaacggg acccttctcg 2160 
gataaagaag aacgaacact tggagcaccg 2220 
tatattcttg tatcatggct tccaccagca 2280 
attggatggg gattgagtgt tcctgatact 2340 
caatataaaa ttgccagatt gcattcggaa 2400 
aataatctag gatctggatt tccgatttat 2460 
ccatcacatt tcaatgaaga ttctgattct 2520 
actccagttg gagtgcgtgc tgaggcaatc 2580 
gaatctgatg aaactgcttt caatacacag 2640 
gggaatcaac atagatacgt gaatagcacc 2700 
ccagctactg agtatgagtt tgcggttaga 2760 
tggtcaatgg ctaccaggaa tcgaacatta 2820 
ctcactgtgc tgcctgcaga aagtggcgat 2880 
ccgaaatatt caaatggaga aattgaagaa 2940 
cttgcggaca aagattggac aattaactat 3000 
agcaatttac ttccaaaagc taattatttc 3060 
catggcccat tctcttctgt agttggatac 3120 
aaagatcgtc acaatgcacg tgggcatgga 3180 
gtcgatcaac ttcaatctct tcttcactct 3240 
tttgctctga ttctgatcct tacactgatt 3300 
agtggtggag ggcgaaagaa tggatatcag 3360 
agtggaggtg gaataggtgg tcttggaggt 3420 
ggaagtcata tgagagctgg tgcttctgat 3480 
cttactgctg ctgacattga atcacccaca 3540 
actctaacac gaagttatca tcaaagctca 3600 
caagtggttt atacaggaac tggaaggcat 3660 
ccttatggtt catcatctgc aattggatcc 3720 
ccaccttctg gtccacctac agtaatcgat 3780 
aattcatcag ctgcaaatgc tcttcggtca 3840 
ccaccacact ctgcggcatc atcatcttct 3900 
caagttccag ttggaagagc aactgctcaa 3960 
ccatttgcat catgctctgc atcttctgat 4020 
tgtatggaaa tgagagaaac cacacccatt 4080 
gagaagatga acacaaatat gaatccttca 4140 
gagaaccttg acacaatgct tgatgatcta 4200 
acgagtatgg ataagtaa 4248 
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Met Glu Asn Ser Leu Arg Cys Vai Trp Val Pro Lys Leu Ala Phe Val 



Leu Phe Gly Ala Ser Leu Leu Ser Ala His Leu Gin Val Thr Gly Phe 
20 25 30 

Gin lie Lys Ala Phe Thr Ala Leu Arg Phe Leu Ser Glu Pro Ser Asp 
35 40 45 

Ala Val Thr Met Arg Gly Gly Asn Val Leu Leu Asp Cys Ser Ala Glu 
50 55 60 

Ser Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Gly lie His 
65 70 ' 75 80 

Leu Ala Leu Gly Met Asp Glu Arg Lys Gin Gin Leu Ser Asn Gly Ser 

85 90 95 

Leu Leu lie Gin Asn lie Leu His Ser Arg His His Lys Pro Asp Glu 
100 105 110 

Gly Leu Tyr Gin Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser lie lie 
115 120 125 

Ser Arg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu Ser 
130 135 140 

Gin Thr Glu Ser Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu Lys 
145 150 155 160 

Cys Glu Val lie Gly Glu Pro Met Pro Thr lie His Trp Gin Lys Asn 
165 170 175 

Gin Gin Asp Leu Thr Pro lie Pro Gly Asp Ser Arg Val Val Val Leu 
180 185 190 

Pro Ser Gly Ala Leu Gin lie Ser Arg Leu Gin Pro Gly Asp lie Gly 
195 200 205 

lie Tyr Arg Cys Ser Ala Arg Asn Pro Ala Ser Ser Arg Thr Gly Asn 
210 215 220 

Glu Ala Glu Val Arg lie Leu Ser Asp Pro Gly Leu His Arg Gin Leu 
225 230 235 240 

Tyr Phe Leu Gin Arg Pro Ser Asn Val Val Ala lie Glu Gly Lys Asp 
245 250 _ 255 

Ala Val Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe Thr 
260 265 270 

Trp Leu Arg Gly Glu Glu Val lie Gin Leu Arg Ser Lys Lys Tyr Ser 
275 280 285 

Leu Leu Gly Gly Ser Asn Leu Leu lie Ser Asn Val Thr Asp Asp Asp 
290 295 300 



Ser Gly Met Tyr Thr Cys Val Val Thr Tyr Lys Asn Glu Asn lie Ser 
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305 310 315 320 

Ala Ser Ala Glu Leu Thr Val Leu Val Pro Pro Trp Phe Leu Asn His 

325 330 335 

Pro Ser Asn Leu Tyr Ala Tyr Glu Ser Met Asp lie Glu Phe Glu Cys 
340 345 350 

Tnr Val Ser Gly Lys Pro Val Pro Thr Val Asn Trp Met Lvs Asn Gly 
355 360 365 

Asp Val Val He Pro Ser Asp Tyr Phe Gin He Val Gly Gly Ser Asn 
3*70 375 380 

Leu Arg lie Leu Gly Val Val Lys Ser Asp Giu Gly Phe Tyr Gin Cys 
385 390 395 400 

Val Ala Glu Asn Glu Ala Gly Asn Ala Gin Thr Ser Ala Gin Leu He 
405 410 415 

Val Pro Lys Pro Aia He Pro Ser Ser Ser Vai Leu Pro Ser Ala Pro 
420 425 430 

Arg Asp Val Vai Pro Vai Leu Vai Ser Ser Arg Phe Vai Arg Leu Ser 
435 440 445 

Trp Arg Pro Pro Aia Glu Ala Lys Gly Asn He Gin Thr Phe Thr Val 
450 455 460 

Phe Phe Ser Arg Glu Gly Asp Asn Arg Giu Arg Aia Leu Asn Thr Thr 
465 470 475 480 

Gin Pro Gly Ser Leu Gin Leu Thr Vai Gly Asn Leu Lys Pro Glu Aia 
485 490 495 

Met Tyr Thr Phe Arg Vai Vai Aia Tyr Asn Glu Trp Gly Pro Gly Giu 
500 505 510 

Ser Ser Gin Pro He Lys Val Aia Thr Gin Pro Giu Leu Gin Vai Pro 
515 520 525 

Gly Pro Vai Giu Asn Leu Gin Ala Vai Ser Thr Ser Pro Thr Ser He 
530 535 540 

Leu He Thr Trp Glu Pro Pro Aia Tyr Aia Asn Gly Pro Val Gin Gly 

550 555 560 

Tyr Arg Leu Phe Cys Thr Giu Vai Ser Thr Gly Lys Giu Gin Asn He 
565 570 575 

Glu Val Asp Gly Leu Ser Tyr Lys Leu Giu Gly Leu Lys Lys Phe Thr 
580 585 590 

Glu Tyr Ser Leu Arg Phe Leu Aia Tyr Asn Arg Tyr Gly Pro Gly Val 
595 600 605 

Ser Thr Asp Asp He Thr Val Val Thr Leu Ser Asp Val Pro Ser Ala 
610 615 620 
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? L O ? i." O G 1 u As n Vci 1 3€; X L6: U G i u Vs 1 1 AS H 361' Ar 9' 3 6; r lie Ly 3 

625 630 635 640 

Vai Ser Trp Leu Pro Pro Pro Ser Gly Thr Gin Asn Gly Phe lie Thr 

645 650 655 

Glv Tvr Lys lie Arg His Arg Lys Thr Thr Arg Arg Gly Giu Met Glu 
660 665 670 

Tnr Leu Glu Pro Asn Asn Leu Trp Tyr Leu Phe Thr Gly Leu Glu Lys 
675 680 685 

Gly Ser Gin Tyr Ser Phe Gin Vai Ser Ala Met Thr Val Asn Gly Thr 
690 695 700 

Glv rro Pro Ser Asn Trp Tyr Thr Ala Glu Thr Pro Glu Asn Asp Leu 
■^Cl 710 715 720 

Asi. G^u Ser Gin Val Pro Asp Gin Pro Ser Ser Leu His Val Arg Pro 

725 730 735 

Gin Tnr Asn Cys lie lie Met Ser Trp Thr Pro Pro Leu Asn Pro Asn 
740 745 750 

lie Val Val Arg Gly Tyr lie lie Gly Tyr Gly Val Gly Ser Pro Tyr 
755 760 765 

Ala Giu Thr Vai Arg Vai Asp Ser Lys Gin Arg Tyr Tyr Ser lie Glu 
770 775 780 

Arg Leu Giu Ser Ser Ser His Tyr Vai lie Ser Leu Lys Ala Phe Asn 
785 790 795 800 

Asn Ala Gly Giu Gly Vai Pro Leu Tyr Glu Ser Ala Thr Thr Arg Ser 

805 810 815 

lie Thr Asp Pro Thr Asp Pro Val Asp Tyr Tyr Pro Leu Leu Asp Asp 
820 ' 825 830 

Phe Pro Thr Ser Vai Pro Asp Leu Ser Thr Pro Met Leu Pro Pro Val 
835 840 845 

Gly Vai Gin Ala Val Ala Leu Thr His Asp Ala Val Arg Vai Ser Trp 
850 855 860 

Ala Asp Asn Ser Val Pro Lys Asn Gin Lys Thr Ser Giu Val Arg Leu 
865 870 875 880 

Tyr Thr Vai Arg Trp Arg Thr Ser Phe Ser Ala Ser Ala Lys Tyr Lys 

885 890 895 

Ser Giu Asp Thr Thr Ser Leu Ser Tyr Thr Ala Thr Gly Leu Lys Pro 
900 905 910 

Asn Thr Met Tyr Glu Phe Ser Val Met Val Thr Lys Asn Arg Arg Ser 
915 920 925 
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Ser Thr Trp Ser Met Thr Ala His Ala Thr Thr Tyr Glu Ala Ala Pro 
930 935 940 

Thr Ser Ala Pro Lys Asp Phe Thr Val lie Thr Arg Glu Gly Lys Pro 
945 950 955 ' 960 

Arg Ala Val lie Val Ser Trp Gin Pro Pro Leu Glu Ala Asn Gly Lys 

965 970 975 

lie Thr Ala Tyr lie Leu Phe Tyr Thr Leu Asp Lys Asn He Pro He 
980 985 990 

Asp Asp Trp He Met Glu Thr lie Ser Gly Asp Arg Leu Thr His Gin 
995 1000 1005 

He Met Asp Leu Asn Leu Asp Thr Met Tyr Tyr Phe Arg He Gin Ala 
1010 1015 1020 

Arg Asn Ser Lys Gly Val Gly Pro Leu Ser Asp Pro He Leu Phe Arg 
1025 1030 1035 1040 

Thr Leu Lys Val Glu His Pro Asp Lys Met Ala Asn Asp Gin Gly Arg 
1045 1050 1055 

His Gly Asp Gly Gly Tyr Trp Pro Val Asp Thr Asn Leu He Asp Arg 
1060 1065 1070 

Ser Thr Leu Asn Glu Pro Pro He Gly Gin Met His Pro Pro His Gly 
1075 1080 1085 

Ser Val Thr Pro Gin Lys Asn Ser Asn Leu Leu Val He He Val Val 
1090 1095 1100 

Thr Val Gly Val He Thr Val Leu Val Val Val He Val Ala Val He 
1105 1110 1115 1120 

Cys Thr Arg Arg Ser Ser Ala Gin Gin Arg Lys Lys Arg Ala Thr His 
1125 1130 1135 

Ser Ala Gly Lys Arg Lys Gly Ser Gin Lys Asp Leu Arg Pro Pro Asp 
1140 1145 1150 

Leu Trp He His His Glu Glu Met Glu Met Lys Asn He Glu Lys Pro 
1155 1160 1165 

Ser Gly Thr Asp Pro Ala Gly Arg Asp Ser Pro He Gin Ser Cys Gin 
1170 1175 1180 

Asp Leu Thr Pro Val Ser His Ser Gin Ser Glu Thr Gin Leu Gly Ser 
1185 1190 1195 1200 

Lys Ser Thr Ser His Ser Gly Gin Asp Thr Glu Glu Ala Gly Ser Ser 
1205 1210 1215 

Met Ser Thr Leu Glu Arg Ser Leu Ala Ala Arg Arg Ala Pro Arg Ala 
1220 1225 1230 



Lys Leu 



Met He Pro Met Asp Ala Gin Ser Asn Asn Pro Ala Val Val 
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1235 1240 1245 

Ser Ala lie Pro Val Pro Thr Leu Glu Ser Aia Gin Tyr Pro Gly lie 
1250 1255 1260 

Leu Pro Ser Pro Thr Cys Gly Tyr Pro His Pro Gin Phe Thr Leu Arg 
1265 1270 1275 1280 

Pro Val Pro Phe Pro Thr Leu Ser Val Asp Arg Gly Phe Gly Ala Gly 
1285 1290 1295 

Arg Ser Gin Ser Val Ser Glu Gly Pro Thr Thr Gin Gin Pro Pro Met 
1300 1305 1310 

Leu Pro Pro Ser Gin Pro Glu His Ser Ser Ser Glu Glu Ala Pro Ser 
1315 1320 1325 

Arg Thr lie Pro Thr Ala Cys Val Arg Pro Thr His Pro Leu Arg Ser 
1330 1335 1340 

Phe Ala Asn Pro Leu Leu Pro Pro Pro Met Ser Ala lie Glu Pro Lys 
1345 1350 1355 1360 

Val Pro Tyr Thr Pro Leu Leu Ser Gin Pro Gly Pro Thr Leu Pro Lys 
1365 1370 1375 

Thr His Val Lys Thr Ala Ser Leu Gly Leu Ala Gly Lys Ala Arg Ser 
1380 1385 1390 

Pro Leu Leu Pro Val Ser Val Pro Thr Ala Pro Glu Val Ser Glu Glu 
1395 1400 1405 

Ser His Lys Pro Thr Glu Asp Ser Ala Asn Val Tyr Glu Gin Asp Asp 
1410 1415 1420 

Leu Ser Glu Gin Met Ala Ser Leu Glu Gly Leu Met Lys Gin Leu Asn 
1425 1430 1435 1440 

Ala He Thr Gly Ser Ala Phe 
1445 
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atggagaata gtcttagatg tgtttgggta 

tccttgctca gcgcgcatct tcaagtaacc 

cgcttcctct cagaaccttc tgatgccgtc 

tgctccgcgg agrccgaccg aggagttcca 

ctggccttgg gaatggatga aaggaagcag 

aacatacttc attccagaca ccacaagcca 

ttaggagatt ctggctcaat tattagtcgg 

aggttccttt cacagacaga atctgtcaca 

tgtgaagtca ttggggagcc catgccaaca 

actccaatcc caggtgactc ccgagtggtg 



cccaagctgg cttttgtact cttcggagct 60 

ggttttcaaa ttaaagcttt cacagcactg 120 

acaatgcggg gaggaaatgt cctcctcgac 180 

gtgatcaagt ggaagaaaga tggcattcat 240 

caactttcaa atgggtctct gctgatacaa 300 

gatgagggac tttaccaatg tgaggcatct 360 

acagcaaaag ttgcagtagc aggaccactg 420 

gccttcatgg gagacacagt gctactcaag 480 

atccactggc agaagaacca acaagacctg 540 

gtcttgccct ctggagcatt gcagatcagc 600 
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cgactccaac 
agaacaggaa 
tattttctgc 
tgttgtgttt 
caactcaggt 
acagatgatg 
gcctctgcag 
tatgcctatg 
actg tgaat t 
ggaggaagca 
gtggctgaaa 
gcaatcccaa 
tccagccgat 
actttcacgg 
cagcctgggt 
cgagttgtgg 
acacagcctg 
cctacctcaa 
tacagattgt 
ctatcttata 
tataatcgct 
gtgccaagtg 
gttagctggc 
cgacacagaa 
tacctattca 
gtcaatggta 
gatgaatctc 
atcatcatga 
ggttatggcg 
tattccattg 
aa tgccggag 
actgacccag 
tccaccccca 
agggtcagct 
tacaccgtcc 
acatctctaa 
atggtaacaa 
gaagcagccc 
cgtgccgtca 
atcttatttt 
agtggtgata 
cgaattcaag 
actctgaaag 
ggttattggc 
ggacaaatgc 
atcattgtgg 
tgcacccgac 
aggaagggca 
gagatgaaaa 
caaagttgcc 
aaaagcacct 
gagaggtcgc 
cagtccaaca 
tacccaggaa 
cctgtgccat 
gtgagtgaag 
tctagcagcg 
ccactccgca 



cgggggacat 
atgaagcaga 
aaagaccatc 
ctggctatcc 
ctaaaaagta 
acagtggaa t 
agctcacagt 
aaagcatgga 
gga tgaagaa 
act tacggat 
acgaggctgg 
gctccagtgt 
ttgtccgtct 
tctttttctc 
cccttcagct 
cttacaatga 
agttgcaagt 
ttcttattac 
tctgcactga 
aactggaagg 
atggtccggg 
ccccgcctca 
tgcctcctcc 
agacgacccg 
caggactgga 
ctggaccacc 
aagttcctga 
gttggactcc 
ttgggagccc 
agaggttaga 
aaggagttcc 
ttgattatta 
tgctcccacc 
gggcagacaa 
ggtggagaac 
gttacacagc 
aaaacagaag 
ccacctctgc 
ttgtgagttg 
ataccttgga 
ggcttactca 
cacgaaattc 
tggaacaccc 
cagttgatac 
accccccgca 
tcaccgttgg 
gctcttcagc 
gccagaagga 
atattgaaaa 
aagacctcac 
ctcattcagg 
tggctgcacg 
atcctgctgt 
tcctcccgtc 
tcccaacact 
gaccaactac 
aggaggcacc 
gctttgctaa 



cggaatttac 

agtcagaatt 

caatgtagta 

tccaccaagt 

ttctttattg 

gta tacctgt 

cttggttccg 

tattgagttt 

tggagatgug 

acttggggtg 

aaatgcccag 

cctcccttcg 

cagctggcgc 

cagagaaggt 

cactgtggga 

atggggaccg 

tccagggcca 

ctgggaaccc 

ggtgtccaca 

cctgaaaaaa 

cgtctctact 

gaacgtctcc 

atcaggaaca 

caggggtgag 

gaaaggaagt 

ttccaactgg 

tcaaccaagc 

tcccttgaac 

ttacgctgag 

gtcaagttcc 

tctttatgaa 

tcctttgctt 

agtaggtgta 

ctctgtccct 

cagcttttct 

aacaggcctc 

gtccagtact 

tcccaaggac 

gcagcctccc 

caagaacatc 

tcaaatcatg 

aaaaggagtg 

tgacaaaatg 

taatttgatt 

tggcagtgtc 

tgtcatcaca 

ccagcagaga 

cctccgaccc 

gccatctggc 

accagtcagc 

tcaagacact 

ccgagccccc 

cgtgagcgcc 

tcccacctgt 

ctcagtggac 

ccaacaacca 

aagcagaacc 

tcctttgcta 
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cgatgcucag 
ttatcaga tc 
gccat tgaag 
ttracctggt 
ggtggaagca 
gttgtcacat 
ccatggtttt 
gaatgtacag 
gtcattccca 
gtgaagtcag 
accagtgcac 
gctcccagag 
ccacctgcag 
gacaacaggg 
aacctgaagc 
ggagagagtt 
gtagaaaacc 
cctgcctatg 
ggaaaagaac 
ttcaccgaat 
gatgatataa 
ctggaagtgg 
caaaatggat 
atggaaacac 
cagtacagtt 
tatactgcag 
tctcttcatg 
ccaaacatcg 
acagtgcgtg 
cattatgtaa 
agtgccacca 
gatgatttcc 
caggctgtgg 
aagaaccaaa 
gcaagtgcaa 
aaaccaaaca 
tggagcatga 
tttacagtca 
ttggaagcca 
ccaattgatg 
gatctcaacc 
gggccactct 
gctaatgacc 
gatagaagca 
actcctcaga 
gtgctggtag 
aagaaacggg 
cctgatcttt 
actgaccctg 
cacagccagt 
gaggaagcag 
cgggccaagc 
atcccggtgc 
ggatatcccc 
cgaggtttcg 
cctatgctgc 
atccccacag 
cctccaccaa 



ctcgaaatcc 

caggactgca 

gaaaaga tgc 

tacgaggcga 

acttgcttat 

ataaaaatga 

taaatcatcc 

tctctggaaa 

gtgattattt 

atgaaggctt 

agctcattgt 

atgtggtccc 

aagcgaaagg 

aacgagcatt 

cagaagccat 

ctcaacccat 

tgcaagctgt 

caaacggtcc 

agaatataga 

atagtcttcg 

cagtggttac 

tcaattcaag 

ttattaccgg 

tggagccaaa 

tccaggtgtc 

agactccaga 

tgaggcccca 

tggtgcgagg 

tggacagcaa 

tctccctaaa 

ccaggtctat 

ccacctcggt 

ctcttaccca 

agacgtctga 

aatacaagtc 

caatgtatga 

ctgcacatgc 

ttactaggga 

atgggaaaat 

actggattat 

ttgatactat 

ctgatcccat 

aaggtcgtca 

ccctaaatga 

agaacagcaa 

tggtcatcgt 

ccacccacag 

ggatccatca 

caggaaggga 

cagaaaccca 

ggagctctat 

tcatgattcc 

caacgctaga 

acccgcagtt 

gagcaggaag 

ccccatctca 

cttgtgttcg 

tgagtgcaat 
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agccagctca 66C 
tagacagctg 72C 
rgtcctggaa 78C 
ggaagtcatc 840 
ctccaatgtg 900 
gaatattagt 96C 
ttccaacctg 1020 
gcctgtgccc 1080 
tcagatagcg 1140 
ttatcaatgt 1200 
ccctaagcct 1260 
tgtcttggtt 1320 
gaacattcaa 1380 
gaatacaaca 1440 
gtacaccttt 1500 
caaggtggcc 1560 
atctacctca 1620 
agtccaaggt 1680 
ggttgatgga 1740 
attcttagct 1800 
actttccgac 1860 
aagtatcaaa 1920 
ctataaaatt 1980 
caacctctgg 2O40 
agccatgaca 2100 
gaatgatcta 2160 
gactaactgc 2220 
ttatattatc 2280 
gcagcgatat 2340 
agcttttaac 2400 
aaccgatccc 2460 
cccagatctc 2520 
tgatgctgtg 2580 
ggtgcgactt 2640 
agaagacaca 2700 
attctcggtc 2760 
caccacgtat 2820 
agggaagcct 2880 
tactgcttac 2940 
ggaaacaatc 3000 
gtattacttt 3060 
cctcttcagg 3120 
tggagatgga 3180 
gccgccaatt 3240 
cctgcttgtg 3300 
ggctgtgatt 3360 
tgctggcaaa 3420 
tgaagaaatg 3480 
ctctcccatc 3540 
actgggaagc 3600 
gtccactctg 3660 
catggatgcc 3720 
aagtgcccag 3780 
cactctccgg 3840 
aagtcagtca 3900 
gcctgagcat 3960 
accaactcac 4020 
agaaccgaaa 4080 
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because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos,: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box 11 Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1 . I I As ail required additional search fees were timely paid by the applicant, this International Search Report covers all 
I ' searchable claims. 
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of any additional fee. 
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This International Searching Authority found multiple (groups of) 

inventions in this international application, as follows: 

1. Claims: 1-18 (totally) and 19-28,30-59,61-64,66-67, 

69 (all partially) 



1.1. Claims: 1-6 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 

A human Unc-5Cb protein (SEQ ID 2), nucleic acids 
encoding it (SEQ ID 1), vectors and cells expressing 
it and an antibody binding thereto. Methods for 
identifying compounds which are capable of modulating 
the binding of Unc-5Cb to an interacting protein. 



1.2. Claims: 7-12,71,85 (totally) and 19-28,30-59,51-64, 

66-67,69 (all partially) 

As for subject 1.1, but concerning a human Unc-5Cc 
protein (SEQ ID 4) . 



1.3. Claims: 13-18 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 

As for subject 1.1, but concerning a human Unc-5C8 
protein (SEQ ID 6). 



2. Claims: 19,29-58 (all partially) 

A method, as characterised in claim 19, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



3. Claims: 20,29-58 (all partially) 

A method, as characterised in claim 20, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



4. Claims: 21,29-58 (all partially) 

A method, as characterised in claim 21, for identifying 

compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



5. Claims: 22,29-58 (all partially) 
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A method, as characterised in claim 22, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



6. Claims: 23-58 (all partially) 

A method, as characterised in claim 23, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



7. Claims: 59,61-64,66-67,69 (all partially) and claims 60,65, 
68 (totally) 



A method for identifying compounds reducing or inhibiting 
the lethal phenotype associated with the expression of an 
UNC-5 death domain in yeast. 



8. Claims: 70,80-84 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 



A nucleic acid encoding the human unc-5Hl homolog (SEQ ID 
7), probes and anti sense nucleic acids hybridizing 
therewith, vectors and cells comprising it. Methods for 
identifying compounds which are capable of modulating the 
binding of Unc-5H1 to an interacting protein. 



9. Claims: 72-73 (totally) and 19-31,53 (all partially) 

A nucleic acid obtainable by digestion of pYMP17 with EcoRI 
and Xhol (SEQ ID 56). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 55. 



10. Claims: 74-75 (totally) and 19-31,54 (all partially) 

A nucleic acid obtainable by digestion of pYMP6 with EcoRI 
and Xhol (SEQ ID 54). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 54. 



11. Claims: 76-77 (totally) and 19-31,57 (all partially) 

A nucleic acid obtainable by digestion of pYMPU with EcoRI 
and Xhol (SEQ ID 61). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 61. 
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12. Claims: 78-79 (totally) and 19-31,58 (all partially) 

A nucleic acid obtainable by digestion of pYMP12 with EcoRI 
and Xhol (SEQ ID 63). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 63. 
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necessarily linked by a common inventive concept, could be searched 
without effort justifying an additional fee. 
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